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ABSTRACT 

This article describes the types of nongraded 
elementary schools that were prevalent from the 1950s through the 
1970s, reviews research on the academic achievement effects of 
nongraded schools, and draws inferences from this research for 
application of nongraded" systems in today's schools. Research in 
which an objective meastire of achievement was used and for which 
comparability of nongradid and graded scunples was esteiblished was 
reviewed. The review established 5 categories of nongraded programs: 
(1) progr«uns in which students were grouped according to performance 
in one subject; (2) programs in which students were grouped according 
to performance m several subjects; (3) programs that used 
individualized instruction; (4) individually guided education 
programs; and (5) programs that were not explicitly described in the 
research. Results indicated consistent positive achievement effects 
for students in groupings in the first two categories. Nongraded 
programs that made use of individualized instruction were less 
consistently successful. It is concluded that nongraded organization 
has a positive impact on student achievement if it allows teachers to 
provide direct lns::ructlon to students outside of the framework of 
individualized instruction. A list of 110 references and a set of 6 
tables are provided. (EC) 
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The Center 



The mission of the Center for Research on Effective Schooling for Disadvantaged Students 
(CDS) is to significantly improve the education of disadvantaged students at each level of 
schooling through new knowledge and practices produced by thorough scientific study and 
evaluation. The Center conducts its research in four program areas: The Early and Elementary 
Education Program, The Middle Grades and High Schools Program, the Language Minority 
Program, and the School, Family, and Community Connections Program. 

The Early and Elementary Education Program 

This pro-am is working to develop, evaluate, and disseminate instructional programs 
capable of bringing disadvantaged students to high levels of achievement, particularly in the 
fundamental areas of reading, writing, and mathematics. The goal is to expand the range of 
effective alternatives which schools may use under Chapter 1 and other compensatory education 
funding and to study issues of direct relevance to federal, state, and local policy on education of 
disadvantaged students. 

The Middle Grades and High Schools Program 

This program is conducting research syntheses, survey analyses, and field studies in middle 
and high schools. The three types of projects move from basic research to useful practice. 
Syntheses compile and anal.yze existing knowledge about effective education of disadvantaged 
students. Survey analyse^ identify and describe current programs, practices, and trends in middle 
and high schools, and allow studies of their effects. Field studies are conducted in collaboration 
with school staffs to devqlop and evaluate effective programs and practices. 

The Language Minority Program 

This program represients a collaborative effort. The University of California at Santa 
Barbara is focusing on the education of Mexican-American students in California and Texas; 
studies of dropout among children of recent immigrants are being conducted in San Diego and 
Miami by Johns Hopkins, and evaluations of learning strategies in schools serving Navajo 
Indians are being conducted by the University of Northern Arizona. The goal of the program is 
to identify, develop, and evaluate effective programs for disadvantaged Hispanic, American 
Indian, Southeast Asian, and other language minority children. 

The School, Family, and Community Connections Program 

This program is focusing on the key connections between schools and families and between 
schools and communities to build better educational programs for disadvantaged children and 
youth. Initial work is seeking to provide a research base concerning the most effective ways for 
schools to interact with and assist parents of disadvantaged students and interact with the 
community to produce effective community involvement. 
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Achievement Effects of the Nongraded Elementary 
School: A Retrospective Review 



Abstract 

A nongraded elementary program is one in which children are flexibly 
grouped according to performance level, not age, and proceed through the 
elementary school at their own rates. Popular in the 1950's, '60's, and early '70's, 
tl;e nongraded plan is returning today. This article reviews research on the 
achievement effects of nongraded organization. Results indicated consistent 
positive achievement effects of simple forms of nongrading generally developed 
early: cross-grade grouping for one subject (median ES = +.46) and cross-grade 
grouping for many subjects (median ES = +.34). Forms of nongrading making 
extensive use of individualization were less consistently successful (median 
ES = +.02). Studies of Individually Guided Education, which used nongrading 
and individualization, also produced inconsistent effects (median ES = +. 1 1 ). The 
article concludes that nongraded organization can have a positive impact on student 
achievement if cross-age grouping is used to allow teachers to provide more direct 
instruction to students, but not if it is used as a framework for individualized 
instruction. 




Acknowledgments 



We would like to thank Robert H. Anderson and John Goodlad for their comments 
on an earlier version of this report. 

/ 



ERIC 



iv 

7 



Introduction 



Greek mythology tells us of the 
cruel robber, Procrustes (the 
stretcher). When travelers sought 
his house for shelter, they were tied 
to an iron bedstead. If the traveler 
was shorter than the bed, 
Procrustes stretched him until he 
was the same length as the bed. If 
he was longer, his limbs were 
chopped off to make him fit. 
Procrustes shaped both short and 
tall until they were equally long and 
equally dead. 

[Graded systems of school 
organization] trap school-age 
travelers in much the same fashion 
as Procrustes bed trapped the 
unwary. (Goodlad & Anderson, 
1959. p. 1). 

So begins the book that launched one of the 
most interesting innovations in the history of 
education: The Nongraded Elementary 
School, by John Goodlad and Robert 
Anderson. The nongraded elementary school 
movement was an important force in North 
American education in the 1960's and early 
1970's, even if its major elements were only 
implemented in a small proportion of 
schools. 

The challenge to the traditional age-graded 
classroom posed by the nongraded concept is 
one that still has relevance today, and the 
nongraded elementary school itself is 



reappearing in U.S. schools. Recently, the 
states of Kentucky and Oregon have 
promoted a shift to nongraded programs, and 
many districts and schools elsewhere are 
moving in this direction (Willis, 1991). 

A great deal of research has been done to 
evaluate various forms of the nongraded 
elementary school, but there are few 
comprehensive reviews on this topic. 
McLoughlin (1967), reviewing studies done 
up to 1966, concluded that most found no 
differences between graded and nongraded 
programs in reading, arithmetic, and 
language arts performance. In contrast, 
Pavan (1977), who limited her review of 
achievement to studies reported between 
1968 and 1976, concluded that most 
comparisons favored the nongraded plan. 

However, both of these reviews were quite 
limited. Both simply counted statistically 
significant findings favoring graded or 
nongraded programs, paying little attention to 
the particular forms of nongrading used, the 
methodological quality of the studies, or the 
size of the effects. 

The purpose of this article is to describe the 
nongraded elementary school in its earlier 
incarnations, to systematically review 
research on the academic achievement effects 
of nongraded schooling, and to draw 
inferences from this research for applications 
of the nongraded ideal in today's schools. 



What h a Nongraded Elementary School? 



The term nongraded (or ungraded) 
elementary school covers a wide range of 
school and classroom organizational 
arrangements. Central to the concept is the 
elimination of traditional grade level 
designations. Instead, students are grouped 
according to their level of academic 



performance, not their ages. Sometimes this 
grouping is done for just one subject, 
sometimes for manv subjects, and sometimes 
students are placed in self-contained multi- 
age classrooms according to reading 
performance or general ability. For example, 
a nongraded reading class might contain six-, 
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seven-, and eight-year-old students, all 
reading at what would ordinarily be 
considered the second grade level. Students 
are allowed to proceed through the grades at 
their own rates. Some may take longer than 
usual to complete the elementary grades, 
while others may complete elementary school 
in less time than usual. Because a school has 
classes at many levels, a child who spurts 
ahead or falls behind can easily be moved to 
another class appropriate to his or her level. 
As a result, no child is ever retained or 
skipped a whole grade at once. 

Frequently, the nongraded program applies 
only to the primary grades (1-3 or K-3), and 
is called a nongraded primary school. This is 
the main form that is returning today (Willis, 
1991). The idea is that all students will have 
a certain level of academic performance on 
entering fourth grade, but may have taken 
more or less time to reach tnis level. 

In their original conception, nongraded 
elementary schools usually incorporated a 
curriculum structure called "continuous 
progress," in which the skills to be learned in 
such subjects as reading and mathematics are 
organized into a hierarchical series of levels 
covering all the grades involved in the plan 
(usually 1-3 or 1-6). For example, the 
reading curriculum ordinarily taught in grades 
1-3 might be organized in four levels per 
grade, for a total of twelve levels for the 
entire nongraded period leading to grade 
four. 

In a continuous-progress model, students 
pick up each year where they left off the 
previous year. For example, a low achiever 
who has only completed Level S at the end of 
his second year would start with Level 6 at 
the beginning of his third year (rather than 
being retained in second grade, as might 
occur in a graded school). A high achiever 
who mastered Level 12 at the end of her 
second year might simply go to fourth grade 
a year early. 

Goodlad and Anderson (1963) recommended 
having such a hierarchical structure only for 
hierarchical subjects, such as reading and 
math, and using interest groupings, age 
groupings, or other criteria for subjects (such 



as social studies and science) which depend 
to a lesser degree on prerequisite skills. 

Beyond the use of flexible, multi-age 
groupings, actual operationalizations of the 
nongraded elementary school model have 
varied enormously. At one end of a 
continuum of complexity, nongraded 
organization is essentially equivalent to the 
Joplin Plan (Floyd, 1954; Slavin, 1987). 
This is an arrangement in which students are 
grouped across grade lines for just one 
subject, almost always reading. For 
example, at a common reading period all 
students might move to a class composed of 
students at the same performance level in 
reading drawn from different classes and 
grade levels; a second grade, first semester 
reading class might have first, second, and 
third graders in it. Students move through a 
continuous-progress sequence of reading 
levels which cover the material students are 
expected to learn in all grades involved in the 
plan. They move as rapidly as they are able 
to go, taking as much time as the^ need to 
master the material. Groupings are 
frequently reassessed and changed if student 
performance warrants it. 

The main effect of the use of the Joplin Plan 
is to reduce the number of reading groups 
taught by each teacher, often to one (i.e., 
whole-class instruction), thereby reducing the 
difficulties inherent in managing multiple 
groups and reducing the need for students to 
do follow-up activities independently of the 
teacher. 

The Joplin Plan can be described as a 
nongraded reading program which still 
maintains an age-graded organization for 
other subjects. Studies of the Joplin Plan, 
which was popular in the 1950's and '60's, 
do not make it clear what happened when 
students reached the end of the elementary 
grades and were reading at a level quite 
different from their grade level. 

In the 1960's, nongraded programs began to 
more closely resemble the model described 
by Goodlad and Anderson (1963), which 
suggested flexible multi-age grouping for 
most or all academic subjects, with 



continuous-progress curricula for such 
subjects as reading and mathematics. 

When it was first described and implemented 
in the 1950's and early '60' s, nongraded 
organization primarily involved changes in 
grouping patterns, not instructional methods. 
Teachers in the earlier implementations still 
overwhelmingly taught students in groups 
using traditional metiiods and curricula. 

Starting in the late 1960's, however, the 
nongraded plan often absorbed another 
innovation becoming popular at that time, 
individualized instruction. Increasingly, 
descriptions of nongraded schools began to 
include the extensive use of learning stations, 
learning activity packets, and other 
individualized, student-directed activities. In 
many cases these individual activities were 
also combined with tasks students completed 
in small groups which primarily worked 
independently of the teacher. 

Another typical attribute of these forms of 
nongrading was team teaching. For example, 
two to six teachers mi^ht occupy a section of 
the school and take jomt responsibility for a 
large group of students, flexibly grouping 
and regrouping them throughout the day. As 
time went on, programs or this kind were 
increasingly implemented in schools without 
classroom walls, and tended to be called open 
schools rather than nongraded elementary 
schools (see Giaconia & Hedges, 1982). In 
an introduction to the 1987 reprinting of their 
1963 book, Goodlad and Anderson 
acknowledge the essential commonality 
between the two approaches. 

A good summary of the goals and elements 
of a fully realized operationalization of the 
nongraded ideal is adapted by Goodlad and 
Anderson (1987, pp. xv-xviii) from the 
dissertation of Barbara Pavan (1972), who 
was the principal of one of the earliest model 
nongraded elementary schools and continues 
to be an important advocate of this approach. 
It is presented below. 



I. Goals of Schooling 

1. The ultimate school goal is to 
develop self-directing autonomous 
individuals. 

2. The school should help develop 
individual potentialities to the maximum 
possible. 

3. Each individual is unique and is 
accorded dignity and respect. 
Differences in people are valued. 
Therefore the school should strive to 
increase the variability of individual 
differences rather than stress 
confonnity. 

4. Development of the child must be 
considered in all areas: aesthetic, 
physical, emotional, and social, as well 
as intellectual. 

5. Those involved in the school 
enterprise are co-learners, especially 
teachers and students. 

6. The school atmosphere should 
allow children to enjoy learning, to 
experience work as pleasurable and 
rewarding, and to be content with 
tiiemselves. 

II. Administrative-Organizational 
Framework 

A. Vertical Grouping 

7. Each individual works in varied 
situations where he or she will have 
opportunities for maximum progress, 
liiere are no procedures for retention or 
promotion, nor any grade levels. 

8. A child's placement may be 
changed at any time if it is felt to be in 
the best interests of the child's 
development considering all five phases 
of development: aesthetic, physical, 
intellectual, emotional, and social. 



B. Horizontal Grouping 

9. Gn^uping and subgrouping patterns 
are extremely flexible. Learners are 
grouped and regrouped on the basis of 
one specific task and are disbanded 
when that objective is reached. 

10. Each child should have 
opportunities to work with groups of 
tnany sizes, including one-person 
;^'roups, formed for different purixDses. 

11. The specific task, materials 
required, and student needs determine 
the number of students that may be 
profitably engaged in any given 
educational experience. 

12. Children should have frequent 
contact with children and adults of 
varying personalities, backgrounds, 
abilities, interests, and ages. 

m. Operational Elements 

A. Teaching Materials-Instructional 

13. A wide variety of textbooks, trade 
books, supplemental materials, 
workbooks, and teaching aids must be 
available and readily accessible in 
sufficient quantities. 

1 4. Varied materials must be available 
to cover a wide range of reading 
abilities. 

15. Alternate methods and materials 
will be available at any time so that the 
child may use the learning style and 
materials most suitable to his or her 
present needs and the task at hand 
(including skill building, self-teaching, 
self-testing, and sequenced materials). 

16. A child is not really free to learn 
something she or he has not been 
exposed to. The teacher is responsible 
foi providing a broad range of 
experiences and materials that will 
stimulate many interests in the 
educational environment 



B. Curriculum (knowledge) 

17. The unique needs, interests, 
abilities, and learning rates, styles and 
patterns of each child will determine his 
or her individual curriculum. 
Conformity and rigidity are not 
demanded. 

18. The curriculum should be 
organized to develop the understanding 
of concepts and methods of inquiry 
more than specific content learning. 

19. Process goals will be stressed: 
the development of the skills of inquiry, 
evaluation, interpretation, application - 
"tile skills of learning to learn." 

20. Sequence of learning must be 
determined by each individual student 
and his or her teacher, since: 

(a) no logical or inherent sequence is 
in the various curriculum areas. 

(b) no predetermined sequence is 
appropriate to all learners. 

(c) individual differences in level of 
competence and in interest are 
constantiy in flux. 

21. Each child will formulate his or her 
own learning goals with guidance from 
his or her teachers. 

C. Teaching Metiiods 

22. Different people learn in different 
ways. 

23. Learning is the result of the 
student's interaction with the world she 
or he inhabits. Individuals learn by 
direct experience and manipulation of 
their environment: therefore the child 
must be allowed to explore, to 
experiment, to "mess around," to play, 
and have the freedom to err. 

24. The process is more important 
than the product. How the child learns 
is stressed. 
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25 All phases of human growth- 
aesthetic, physical, intellectual, 
emotional, and social-are considered 
when planning learning experiences for 
a child. 

26. The teacher is a facilitator of 
learning. She or he aids in the child's 
development by helping each one to 
formulate goals, diagnose problem 
areas, suggest alternative plana of 
action; provides resource materials and 
gives encouragement, support, or 
prodding as needed. 

27. Children should work on the level 
appropriate to present anainment and 
should mov^ as quickly as their abilities 
and desires allow them to. 

28. Successful completion of 
challenging experiences promotes 
greater confidence and motivation to 
learn than fear of failure. 

29. Learning experiences based on the 
child's expressed interests will motivate 
the child to continue and complete a 
task successfully much more frequently 
than teacher-contrived techniques. 

D. Evaluation and Reporting 

30. Children are evaluated in terms of 
their past achievements and their own 
potential, not by comparison to group 
norms. Expectations differ for different 
children. 

31. Evaluation by teacher and/or child 
is done for diagnostic purposes and 
results in the formulation of new 
education objectives. 

32. Evaluation must be continuous and 
coniprehensive to fulfill its diagnostic 
purpose. 

33. A child strives mainly to improve 
his or her performance and develop 
potential rather than to compete with 
others. 

34. Teachers accept and respond to the 
fact that growth patterns will be 
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irregular and will occur in differentareas 
at different times. 

35. Individual pupil progress forms are 
used to record the learning tasks 
completed, deficiencies that need new 
assignments to permit mastery, and all 
other data that will show the child's 
progress in relation to past 
achievements and potential or that will 
help the teacher in suggesting possible 
future learning experiences for the 
individual. 

36. Evaluating and reporting will 
consider all five areas of the child's 
development: aesthetic, physical, 
intellectual, emotional, and social. 

Rationales for Nongrading 

The major rationale for a nongraded approach 
is to provide an alternative to both retention 
and social promotion (i.e., promoting 
students regardless of performance). In the 
view of Goodlad and Anderson (1963) and 
many who followed them (e.g., Shepard & 
Smith, 1989), retention is harmful to 
students, is applied inconsistently, and fails 
to take into account developmental 
inconsistencies (e.g., "late bloomers"), 
especially among young children. 

A retained child repeats a whole year of 
content he or she failed to learn the first time. 
Spending a year failing to learn a body of 
curriculum and then spending a second year 
going over the same curriculum seems to be a 
poor practice for low achievers. Advocates 
of nongrading would argue that it is far better 
to allow such students to move more slowly 
through material with a high success rate and 
never have to repeat unlearned content. As 
noted earlier, nongraded elementary 
programs use a "continuous progress" plan, 
in which a hierarchical curriculum (such as 
reading or mathematics) is divided into some 
number of units across the grades, and then 
students can take as much time as they need 
to complete the units. 

A low achiever moving slowly through a 
continuous progress curriculum may take as 
many years to reach tiie fourUi grade as a 
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similar low achiever in a graded structure 
who is retained at some point in grades 1-3, 
but advocates of nongrading would argue that 
the continuous progress plan is less 
stigmatizing, less psychologically damaging, 
and more instructionally sensible than 
retention. 

Nongraded organization also offers an 
alternative to traditional fonns of ability 
grouping. Goodlad and Anderson (1963) 
point out how nongrading can be an 
improvement on both between-class ability 
grouping (e.g., high, middle, and low self- 
contained second grades) and within-class 
ability grouping (e.g., reading O'oups). The 
problem with between-class ability grouping, 
they argue, is that grouping on any one 
criterion (such as reading performance or 
general ability) cannot group students well 
for any particular skill. For example, a class 
grouped according to reading skill will have a 
very broad range of math levels, and will 
even be quite diverse in performance levels 
on any particular reading task. As a result, 
the costs of ability grouping in terms of 
stigmatizing low achievers are not 
compensated for by any practically 
meaningful reduction in heterogeneity. 

Formation of reading groups within 
heterogeneous classes is similarly flawed in 
their view. In order to create homogeneous 
groups, teachers must have many reading 
groups and therefore much class time must be 
spent on follow-up activities of little 
instructional value. 

In thft nongraded plan, students are flexibly 
grouped for major subjects (especially 
reading and math) across class and age lines, 
so that the resulting groups are truly 
homogeneous on the skills being taught. 
Further, by creating multi-age groups from 
among all students in contiguous grade 
levels, it is possible for teachers to create 
entire reading or math classes at one or, at 
most, two levels, so that they need not devote 
much class time to follow-up. 

Finally, the nongraded plan is proposed as a 
solution to the problem of split grades. In 
many schools with, for example, a class size 
of 25 and 38 students in each of grades two 



and three, principals would create one second 
grade class, one third grade class, and a 2-3 
combination class. In a graded structure 
teaching the 2-3 class is difficult, as the two 
portions of the class may be taught 
completely separately. A nongraded 
organization, by eliminating the designation 
of students as second or third graders, solves 
this problem. 

The rationale for the re-emergence of the 
nongraded plan today is similar to that of the 
1950's. In the 1980's, retention rates 
increased dramatically in elementary schools, 
especially those in large cities (Levine & 
Eubanks, 1986-87). This was partly a result 
of accountability pressures, which focus on 
performance of students according to grade 
level, not age, thereby rewarding districts for 
such policies as imposing grade-to-grade 
promotion standards and holding back low 
achievers (see McGill-Franzen & Allington 
1991; Slavin & Madden, 1991). 

However, in more recent years a reaction 
against high retention rates has taken place, 
influenced in particular by the work of 
Shepard and Smith (1989) documenting the 
negative long-term effects of retention in the 
elementary grades. Unwilling to return to 
social promotion (and still under 
accountability pressures which discourage it), 
many school districts are currently 
'Experimenting with a variety of means of 
Holding standards constant while allowing 
time spent in the early grades to vary. 
Among these is the growing use of means of 
adding a year between kindergarten and 
second grade for at-risk children, such as 
developmental kindergarten, junior 
kindergarten, transitional first grade, or pre- 
first programs. However, research on the 
long-term impacts of these approaches has 
questioned their value (see Karweit, in 
press). The nongraded primary has been 
rediscovered as a means of avoiding both 
retention and social promotion, just as it was 
in the 1950's. 

Another rationale for the nongraded primary 
school still important today is a reaction 
against traditional ability grouping. Between- 
class ability grouping (e.g., high, middle, 
and low second grades) has been used by a 
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minority of elementary schools, but use of 
reading groups has been almost universal 
until very recently (McPartland, Coldiron, 
and Braddock, 1987). At present, many 
schools are seeking alternatives to the use of 
set reading groups (see Barr, 1990) and the 
nongraded program appears to be a means of 
doing away with reading groups while still 
allowing teachers to accommodate instruction 
to individual needs. 

An important factor today in the move toward 
the nongraded primary that was not a 
rationale in the 1950's is the trend toward 
*'developmentally appropriate" practices in the 
early grades. By this is meant instructional 
approaches which allow young children to 
develop skills at their own pace. For 
example, the National Association for the 
Education of Young Children (1989) 
published a position statement entitled 
Appropriate Education in the Primary Grades 
which described developmentally appropriate 
education for children ages 5-8. Among the 
prescriptions were the following: 

Each child is viewed as a unique 
person with an individual pattern 
and timing of growth... Children 
are allowed to move at their own 
pace in acquiring important 
skills... For example, it is 
accepted that not every child will 
learn how to read at age 6, most 
will learn by 7, and some will 
need intensive exposure to 
appropriate literacy experiences to 
learn to read by 8 or 9. (p. 4). 

The NAEYC position paper also supported 
integrated curriculum and instruction, 
extensive use of projects and learning 
stations, cooperative learning, and other 
strategies quite consistent with the nongraded 
primary plans of the late 1960's and early 
'70's (and with the open classroom of the 
same period). 



Individually Guided Education 

One important outgrowth of the nongraded 
concept was Individually Guided Education 
(IGE), developed and researched by 
Klausmeier and his colleagues at the 
University of Wisconsin (Klausmeier, 
Rossmiller, & Saily, 1977). IGE, in its 
Wisconsin version or in the one developed by 
the Kettering Foundation (through I/D/E/A), 
was a very ambitious, comprehensive 
restructuring of elementary education. It 
used a nongraded grouping strategy, in 
which students were flexibly grouped 
according to instructional needs rather than 
age. 

As in any nongraded elementary school, 
students could take as much time as they 
needed to complete the objectives prescribed 
for each subject. However, IGE affected all 
aspects of school organization and 
instruction, not only grouping. Individual 
plans were prepared for each student, and 
students were constantly assessed to 
determine their continuing placements. 
Instruction could be delivered one-on-one by 
teachers or peers, to small groups, or (rarely) 
to large groups. 

Extensive use was made of learning stations 
at'^v'hich students could perform experiments, 
work on individualized units, or do other 
individual or small group activities 
independentiy of the teacher. Comprehensive 
instructional models were developed and 
implemented in reading, mathematics, social 
studies, and science. Students were 
organized into multi-age Instruction and 
Research (I & R) units of 100 to l'50 students 
with (ideally) a "unit leader," three to five 
"staff teachers," an aide, and a teacher intern. 
This team planned and carried out the 
instruction students received in all subjects. 
Often, individual teachers would become 
experts in a given subject and take 
responsibility for that subject with the entire 
unit. A building-level Instructional 
Improvement Committee worked to establish 
objectives and policies for the school as a 
whole. 



ERIC 



7 

14 



Review Methods 



This review synthesizes the findings of 
research comparing the achievement effects 
of nongraded and traditional organizations in 
the elementary grades (K-6). The review 
method used is best-evidence synthesis 
(Slavin, 1986), which combines elements of 
meta-analysis (Glass, McGaw, & Smith, 
1981) with those of narrative reviews. 
Briefly, a best-evidence synthesis requires 
locating all research on a given topic, 
establishing well-specified criteria of 
methodological adequacy and germaneness to 
the topic, and then reviewing this "best 
evidence" with attention to the substantive 
and methodological contributions of each 
study. Whenever possible, study outcomes 
are characterized in terms of effect sizes, the 
difference between the experimental and 
control means divided by the control group's 
standard deviation. Details of the review 
procedures are described in the following 
sections. 

Literature Search Procedures 

Every effort was made to obtain every study 
ever reported which met the broad 
substantive inclusion criteria described 
below. Principal sources included the 
Education Resources Information Center 
(ERIC), Dissertation Abstracts, and the 
reference lists of earlier reviews and of the 
primary studies themselves. Most of the 
studies located were doctoral dissertations. 
These were obtained from University 
Microfilms International or from the Library 
of Congress in Washington, DC, which 
maintains microfilm copies of all U.S. 
dissertations. In a few cases where 
unpublished documents could not be found 
or where clarifications were needed about 
important studies, authors were contacted 
directly. 

Substantive Inclusion Criteria 

Studies were included in an initial search if 
they identified themselves as evaluating 
nonjraded, ungraded, multi-age, or 
Individually Guided Education programs in 



grades K-6. Studies spanning elementary 
and middle grades were included, but only 
data up to grade 6 were considered. 

Methodological Inclusion Criteria 

Studies were included if they met the 
following methodological criteria, which are 
identical to those applied in earlier reviews of 
ability grouping by Slavin (1987, 1990): 

1. Some objective measure of 
achievement was used. Because of their 
subjective nature, grades were not 
included as achievement variables. In 
practice, all achievement outcomes were 
assessed using standardized measures. 

2. Initial comparability of the nongraded 
and graded samples was established by 
means of random assignment of students, 
matching of schools or classes, or 
matching of individual students within 
classes or schools. In studies using 
matching, evidence had to be presented 
either to indicate that the groups were 
initially equivalent (within 20% of a 
standard deviation) or, if they were not 
equivalent, pretest data were presented to 
allow for adjustment of posttest scores 
for pretest differences. Studies which 
used gain scores or analyses of 
covariance to control for initial 
differences between nongraded and 
graded programs are listed in separate 
portions of each table, as adjustment for 
pretest differences cannot be assumed to 
completely control for their influence on 
posttests (see Reichart, 1979). Results of 
these studies should be interpreted 
cautiously. 

3. The nongraded program was in place 
for at least a semester. AH studies located 
met this standard; in fact, only two 
studies were less than a year in duration. 

Very few studies which used any 
achievement measure to compare nongraded 
and graded programs were excluded on the 
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basis of these inclusion criteria. Examples of 
studies excluded are ones which involved 
nongraded secondary schools (e.g., 
Chalfant, 1972), studies without any 
evidence that nongraded and control groups 
were initially equivalent and without 
adjustments for pretests (e.g., Ingram, 
1960), and studies of school organization 
plans related to but not the same as the 
nongraded program (e.g., Heathers, 1967; 
Maresh, 1971). 

Studies were not excluded if they met the 
above criteria but failed to present data which 
would allow for computation of effect sizes. 
Instead, such studies were discussed in the 
review and were included in all tables with an 
indication of the direction and statistical 
significance of any differences (see below). 

Computation of Effect Sizes 

Whenever possible, effect sizes were 
computed as in earlier reviews of ability 
grouping by Slavin (1987, 1990). In 
general, effect size was computed as the 
difference between the nongraded and graded 
program's means divided by the graded 
program's standard deviation. When means 
or standard deviations were omitted, effect 



sizes v cic estimated from t's, Fs, p's, or 
other statistics, using procedures described 
by Glass, McGaw, & Smith (1981). 
However, one important variation in the 
Glass et al. procedures was used when 
appropriate. If pretest scores were available, 
posttests were adjusted for them using 
ANCOVA's or gain scores. However, 
denominators in the effect size computations 
were always unadjusted individual-level 
posttest standard deviations. The purpose of 
these procedures is to avoid situations in 
which one treatment exceeded another at 
pretest and posttest to the same degree yet the 
posttest difference was coded as meanmgful. 
See Slavin (1987) for more on this 
adjustment procedure and other details of 
effect size computation. 

For some purposes, effect sizes were pooled 
across studies. Whenever this was done, 
medians (not means) were computed on all 
studies from which effect size estimates could 
be derived. Pooling effect sizes within well- 
defmed categories of studies can provide a 
usefiil summary of the size and direction of 
effects but the pooled estimate should always 
be evaluated in light of the quality and 
consistency of the individual studies 
narratively described in the text. 



Categories ofNongraded Programs 



As noted earlier, nongraded elementary 
schools have varied widely in their 
particulars. This variation is not surprising, 
given that the original conception of the 
nongraded idea did not pretend to touch on all 
aspects of schoo. organization and 
instruction: 

Nongrading is a scheme for 
organizing schools vertically. 
It does not account for the 
many problems of organizing 
schools horizontally (Goodlad 
& Anderson, 1963, p. 210). 

In looking at studies of nongraded elementary 
schools over time, an interesting pattern 
emerges. The earlier studies tended to apply 



nongrading to only one subject, usually 
reading. As time went on, studies began to 
include more than one subject, but to still 
maintain traditional curriculum and 
instruction; later still, nongraded programs 
began to incorporate much more radical 
changes in curriculum and instruction, along 
with increased use of team teaching, 
individualized instruction, learning stations, 
peer tutoring, cooperative learning, and so 
on. Individually Guided Education (IGE) 
represented a full flowering of this form of 
nongrading. 

It is possible to distinguish four distinct 
categories of nongraded programs, and this 
review considers each type separately. These 
are as follows. 
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1. Nongraded Programs Involving Only One 
Subjecr (Joplin-Like Programs) 

Nine studies, all reported in the 1950's or 
'60's, evaluated nongraded plans which 
only involved one subject. The subject 
was reading in eight studies, math in one. 

2. Nongraded Prov'^ams Involving Multiple 
Subjects (Compretiensive Programs) 

Fourteen studies, reponed from the late 
1950's or 'eO's to the early 1980's, 
evaluated nongraded plans incorporating 
two or more subjects (and often including 
all academic subjects). This category 
adheres most closely to the original 
conception put forward by Goodlad & 
Anderson (1963), in that the nongraded 
programs emphasize continuous progress 
and flexible, multiage grouping, but do 
not place a major emphasis on 
individualized instruction. 

3. Nongraded Program Incorporating 
Individualized Instruction 

Eleven studies, all but one reported in the 
brief period of 1969-1973, evaluated 
nongraded programs which emphasized 
individualized instruction, learning 



stations, learning activity packages, 
programmed instruction, and/or tutoring. 

4. Individually Guided Education (IGE) 

Ten studies evaluated implementations of 
Individually Guided Education (IGE), 
described earlier. This was the latest 
group of studies, with reports appearing 
over the period 1972-1985. 

5. Studies Lacking an Explicit Description of 
the Nongraded Program 

In addition to the four categories 
discussed above, there were twelve 
studies that failed to state what was 
actually implemented in the nongraded 
programs they evaluated. These were 
generally ex post facto studies, often with 
large samples, in which the researchers 
simply accepted principals' words that 
their schools were nongraded. Given the 
considerable diversity among 
implementations that were described, it 
would be foolish to assume anything 
about what the independent variable in 
these studies really was. However, this 
category is included for the sake of 
completeness. 



Research on Nongrcided Programs 



The following five sections discuss the 
research on each of the categories of 
nongraded programs described above. Each 
section contains a table that summarizes the 
major characteristics and findings of (a) 
randomized studies, then (b) matched 
equivalent studies, and fmally (c) matched 
studies lacking evidence of initial equality, 
and within these categories the larger studies 
are listed first. The text usually follows the 
same order. In general, then, studies listed 
and discussed earlier in each section can be 
considered higher in methodological quality 
than those which come later. 

In each table, effect sizes are nresented for 
each measure or subgroup used, and then an 
overall effect size is presented. Asterisks by 



effect sizes indicate that the differences were 
statistically significant, according to the 
authors. When effect sizes could not be 
computed, outcomes are characterized as 
favoring nongraded (+), no difference (0), or 
favoring graded (-), with asterisks if the 
differences were significant. A key to all 
symbols and abbreviations used in all tables 
appears in an appendix. 

Joplin-Like Nongraded Programs 

Table 1 summarizes the research on 
nongraded programs that have as a distinctive 
feature the homogeneous grouping of 
students according to performance level in 
only one subject. These plans can be labeled 
Joplin-like programs since they share with 
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the Joplin Plan the idea of regrouping 
students for just one subject (usually 
reading), ignoring grade levels or ages. Nine 
studies, all done during the 1960's, are 
included in this category. Most of them were 
described under the Joplin Plan arrangement 
in an earlier synthesis of ability grouping and 
student achievement in elementary schools 
(Slavin, 1987). These studies appeared early 
in the nongraded movement, suggesting that 
the earlier implementations were more 
conservative (affecting only one subject) than 
those which appeared later. 

Results from five of the nine studies found 
strong positive effects for the nongraded 
plans, three studies reported no differences 
between them and graded plans, and one 
significantly favored the graded program. 



Table 1 Here 



Hillson, Jones, Moore, and Van Devender 
(1967) randomly assigned students and 
teachers to nongraded or traditional classes. 
Students in the nongraded classes were 
assigned to heterogeneous classes but 
regrouped across grade levels for reading. 
They proceeded through nine reading levels 
and were continually regrouped on the basis 
of their reading performance. Within each 
reading class teachers had multiple reading 
groups and used traditional basal readers and 
instructional methods. The results of this 
study supported the efficacy of the nongraded 
program. After three semesters, reading 
scores for experimental students on three 
standardized scales were considerably higher 
than for control students (ES = +.72, or 
about +.41 grade equivalents). After three 
years in the program, experimental-control 
differences had diminished but were still 
moderately positive (ES = +.33). 

Three studies compared Joplin-like 
nongraded classes to matched control classes 
and presented evidence of initial 
comparability. In the largest of these, 
Halliwell (1963) evaluated a nongraded 
primary that was virtually identical to the 
Joplin Plan. Students in first through third 
grades were regrouped for reading only, and 



remained in heterogeneous classes ihe rest of 
the day. Spelling was also included in the 
regrouped classes for second and third 
graders. The anicle is unclear as to whether 
within-class grouping was used in regrouped 
reading classes, but there is some indication 
that reading groups were not used. 

Results indicated considerably higher reading 
achievement in nongraded classes than in the 
same school the year before nongrading was 
introduced (ES = +.53). Scores were higher 
for nongraded students at every grade level, 
but by far the largest differences were for 
first graders, who exceeded earlier first grade 
classes by +.94 grade equivalents (ES - 
+1.22). 

It is important to note that mathematics 
achievement, measured at the second and 
third grade levels, also increased significantly 
more in the nongraded classes than in 
previous yea-s (ES = +.51). Because 
mathematics was not part of the nongraded 
program, this finding suggests the possibility 
that factors other than the nongraded plan 
might account for the increases in student 
academic achievement. However, the author 
notes that teachers claimed to have been able 
to devote more time to mathematics because 
the nongraded program required less time for 
reading, spelling, and language instruction 
than they had spent on these subjects in 
previous years. 

A study by Skapski (1960) also evaluated the 
use of a Joplin-like nongraded organization 
for reading only. The details of the 
nongraded program were not clearly 
described, but it appears that reading groups 
were not used in regrouped classes and that 
curricula and teaching methods were 
traditional. Two comparisons were made. 

First, the reading scores of students in the 
nongraded program were compared to the 
same students' arithmetic scores, on the 
assumption that because arithmetic was not 
involved in the nongraded plan any 
differences would reflect an effect of 
nongrading. Results of this comparison 
indicated that second and third grade-aged 
students achieved an average of 1.1 grade 
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equivalents higher in reading than in 
arithmetic. 

Second, scores of third graders who had 
spent 3 years in the nongraded program were 
compared to those students in two control 
schools matched on IQ. Results indicated 
that the nongraded students achieved at a 
much higher level in reading than did control 
students (ES = +.57), but there were no 
differences in arithmetic (which was not 
involved in the nongraded program). 
Differences were particularly large for 
students with IQs of 125 or higher (ES = 
+.91), but were still quite substantial for 
students with IQs in the range 88-1 12 (ES - 
+.52). 

Only one study evaluated the use of a Joplin- 
like program in mathematics. This is a study 
by Hart (1962). Experimental students were 
regrouped for arithmetic instruction across 
grade lines and were taught as a whole class. 
Students were frequently assessed on 
arithmetic skills and reassigned to different 
classes if their performance indicated that a 
different level of instruction was needed. 
Experimental students who had spent 3 years 
in the nongraded arithmetic program were 
matched on IQ, age, and SES with students 
in similar schools using traditional methods. 
It is not stated whether control classes used 
within-class ability grouping for arithmetic 
instruction. Results indicat^ an advantage 
of about one half of a grade equivalent for the 
experimental group 
(ES = +.46). 

Five studies matched Joplin-like nongraded 
classes with graded ones, and dealt 
statistically with initial differences among 
students. In a study in Catholic schools in 
the Archdiocese of St. Louis, Bockrath 
(1958) analyzed the largest sample. She 
conducted three studies in one: first, a 
comparison of the fourth grade reading test 
scores of 1953 and 1956 in the 366 schools 
of the Archdiocese (12,450 students); 
second, the same comparison for a st^tified 
sample of 50 of these schools (3,596 
students); and third, a three-year study of one 
ot the Aiichdiocesan schools to examine how 
the nongraded primary functioned (106 
students). 



In 1956, students had been in nongraded 
program for three years, while the 1953 
pupils had received graded instruction. 
Besides a one point difference in IQ, the 
students differed in entrance age. An effon 
was made by the author to adjust the fourth 
grade reading score medians in relation to IQ, 
but only a narrative description look into 
account the new entrance age adopted for the 
second group of students (a mean increase of 
two months for first year primary entrants in 
1953). The results clearly favored the 
nongraded plan (ES = +.51), which was 
characterized by the creation of flexible skill- 
level groupings. 

The only study of a Joplin-like program that 
found a clear advantage for a grad^ plan is 
presented by Moore (1963). First and 
second grade students' reading and arithmetic 
achievement scores were compared for 
nongraded and conventionally graded 
achools. Following comparable instructional 
practices, second grade students' reading 
scores in the graded and nongraded schools 
did not differ significantly (ES = -.12). 
However, a substantial difference favoring 
the graded program was found among first 
graders' reading scores (ES = -.70). As the 
author surveyed teachers concerning which 
reading materials their pupils were using, he 
found that first grade cono'ol students were 
reading approximately one basal reading text 
higher than the experimental group. 
Arithmetic scores did not differ for first and 
second grade students, but no description is 
given of the instruction in this subject. 

Enevoldsen (1961) did another study that 
found schools which differed in label rather 
than in organizational structure. The graded 
and nongraded schools chosen from the same 
public system had similar nongraded reading 
programs. Consequently, no significant 
difference was found in students' reading 
achievement. Two additional studies of 
relatively low quality, by Jacquette (1959) 
and Kierstead (1963), reported effect sizes 
that were close to zero. Both studies stated 
that learning levels established in the 
nongraded programs followed very closely 
the sequential pattern of graded reading 
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skills, and as a result few differences in 
outcomes were found. 

Summary and discussion: Joplin-like 
nongraded programs. Overall, the findings 
of methodologically adequate studies of this 
type of nongraded program are consistent. 
All studies exhibiting good methodological 
quality (randomized and matched studies with 
evidence of initial equality) found substantial 
positive results in favor of the nongraded 
program. The median effect size for the four 
best-quality studies is + .50; for all studies 
from which effect sizes could be estimated, it 
is + .46. The matched studies lacking 
evidence of initial equality that do not report 
positive results are characterized by similar 
reading programs; the biggest difference 
between them appears to be their label. 

Two features are important in almost all of 
the successful nongraded programs 
evaluated. One is flexibility in pupil 
grouping, with frequent assessment of 
mastery at each level. The second is 
increased amounts of teaching time for the 
homogeneous instructional groups. Because 
each teacher had to manage fewer groups, 
there was less need for independent follow- 
up activities, such as worksheets in reading. 
Perhaps this last characteristic is one of the 
most important elements that favors students 
in a nongraded program: more homogeneous 
groups allow teachers to define more specific 
objectives for instruction and children receive 
a greater amount of direct teaching. 

Comprehensive Nongraded Programs 

Fourteen studies, summarized in Table 2, 
describe plans in which more than one 
subject was nongraded. These were 
conducted from the late 1960's to the 
beginning of the 1980's. Only three 
evaluations present small (and non- 
significant) negative total effect sizes, while 
eight of the ten that present results favoring 
the nongraded plans report statistically 
significant differences. 



Table 2 Here 



Among the eight studies with evidence of 
initial equality, Brody's (1970) is the one 
with a largest sample size. It evaluated a 
n<;ograded program in which first and second 
graders had to pass a series of sequential 
steps in several subjects at 90% mastery, and 
were placed in groups according to their 
mastery of specific skills (regardless of grade 
level). Vertical advancement of students was 
strongly emphasized. At the time of 
assessment, first graders had been in this 
program one year and second graders two 
years. Both groups of students gained 
significantly more than did students in graded 
classes (ES = +.20). Effects were 
particularly large in mathematics (ES = 
+.73). This study was somewhat flawed by 
the fact that before matching, the nongraded 
students were 5.4 points higher in IQ than 
their graded counterparts. 

The only matched equivalent study to find no 
differences in achievement between 
nongraded and graded programs was one 
done by Otto (1969), which took place in a 
laboratory school at the University of Texas. 
Unlike many other studies of nongraded 
plans, this study fully described the 
nongraded intervention, which was designed 
to be a full-scale implementation of the 
Goodlad and Anderson (1963) nongraded 
plan. 

Unfortunately, experimental and control 
groups did not differ on many elements held 
to be essential to the nongraded program. 
Teachers of the nongraded classes did assign 
students to instructional groups across grade 
lines, had students use materials suited to 
different levels, and provided less whole- 
class instruction than did teachers in the 
graded program. However, the nongraded 
classes did not use more subgroups than 
graded classes and did not reduce the 
heterogeneity of subgroups. Because the 
experiment took place in a laboratory school, 
it may be that control classes were of high 
quality and control teachers may have used 
many aspects of nongrading in their classes. 

It is interesting to note that two other studies 
conducted in university laboratory schools by 
Muck (1966) and Ross (1967) also failed to 
find differences between nongraded and 
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graded classes. The principal differences 
between the graded and non graded programs 
in the Muck stud> concern the sequence in 
which the curriculum Was taught and the 
policy that children in the nongraded program 
remain with the same teacher for three years 
while three different teachers faced students 
in the graded plan. These are not key issues 
in the usual characterization of the nongraded 
approach. 

Perrin (1969) also found slight differences in 
favor of the nongraded plans (ES = +.11). As 
he analyzed the S&ta at the end of each year in 
a three-year study, it became clear that as time 
passed, the results started to differ 
significantly. Perrin evaluated a nongraded 
program in which a minimum skills chart was 
used to trace the progress of each child. 
These basic skills in reading, language, and 
arithmetic were divided into levels, and 
children moved through them at their own 
paces. 

Buffie (1962) compared the scores children 
obtained during their last year of a graded or 
nongraded primary program. In the graded 
plan, pupils worked on the same program at 
the same time in all subjects except reading. 
In the nongraded plan, grouping within as 
well as between classrooms was done in 
reading, arithmetic, and spelling. Team 
teaching practiced in the nongraded schools 
also differentiated the plans. The results 
favor the nongraded plan (total ES - +.34), 
especially on the language subtest (ES - 
+.67). y 

Another study that pointed out sharp 
differences in the instructional practices of the 
two groups compared was done by Guarino 
(1982). Using an index for nongradedness 
(Pavan, 1972), he tested the congruence 
between labels and structures. The main 
distinction was that grouping in the 
nongraded program was intended to provide 
an appropriate level of instruction for all 
students, and was guided by frequently 
administered diagnostic tests to discover 
deficiencies in skill areas. High and low 
achievers in the graded program had 
problems in receiving instruction appropriate 
to their special needs. The standardized 
scores differed significantly in favor of the 



nongraded program (total ES = +.34), 
especially in the reading subtests (ES = +.49) 
and for the older students. 

Ramayya (1972) reported positive results for 
the sixth year students in a nongraded school 
(total ES = +.42). For six years, these 
students attended reading and arithmetic 
classes that were reorganized into several 
levels. This study confirms the findings 
reported by Perrin, Brody, Buffie, and 
Guarino: the longer the duration of the 
nongraded program, the greater its favorable 
impact on student academic, achievement. 

Among the matched studies lacking evidence 
of initial equality, the largest was a study by 
Zerby (1960). Instructional practices were 
similar in the two programs he compared. 
Reading and arithmetic texts differed between 
programs, and the nongraded program 
provided the students with the same teacher 
for all three years, while different teachers 
every year instructed children in the graded 
plan. Despite the resemblance in instructional 
practices, the results significantly favored the 
nongraded program (total ES = +.34), 
especially in arithmetic (ES = +.57). 

Lawson (1973) and Morris (1968) conducted 
studies that have several similarities. Reading 
and mathematics programs were organized by 
levels, and regrouping allowed teachers to 
teach classes of students all at one level in 
each subject. In the nongraded plans studied, 
team teaching was described by Lawson, 
while Morris emphasized the fact that 
teachers did not face more than two different 
ability groups per class. Both studies found 
positive results which increased with time. 
After three and five years, significant 
differences favored the nongraded programs. 

After only one year of intervention, Chastain 
(1961) evaluated the academic achievement of 
students in an intermediate school that shifted 
from a graded structure to achievement-level 
grouping in reading classes and finally to a 
nongraded plan. No differences were found 
in the reading achievement of students 
belonging to either plan; a negative difference 
in arithmetic achievement became smaller in 
the second year of homogeneous grouping 
(fu'st year of the nongraded plan). 



Another study which evaluates what could be 
considered a comprehensive nongraded 
program was conducted by Gumpper (1971). 
Test scores from children attending the first 
four years of school in a continuous progress 
program and in a modified self-contained 
graded program were compared. Some 
ability grouping was used for mathematics 
and reading classes at the same grade level in 
the control school. Students changed classes 
several times during each day, breaking some 
of the atmosphere of the self-contained 
classroom situation. In the experimental 
school, children were grouped 
homogeneously according to achievement for 
language arts and mathematics. Since the 
nongraded program was introduced, 
problems with ability grouping occurred and 
teachers had to deal with as many as three 
different levels of children at the same time. 

The fact that the posttest was administered at 
the end of the first year of the nongraded 
program is Gumppcr's main explanation for 
the positive results for first graders and the 
negative results for second, third, and fourth 
grade students. Rather than a true difference 
between graded and nongraded plans, the 
author believes that the lower achievement 
gains were more a function of problems of 
reorientation for older students in the 
continuous progress school. 

Summary and discussion: Comprehensive 
nongraded programs. Findings from this 
group of studies are consistently in favor of 
the nongraded program. Almost all of its 
positive results are significant; not one study 
found significant differences in favor of the 
graded plan. The median effect size for the 
matched equivalent studies was .34, and it 
was the same for all nine studies from which 
effect sizes could be estimated. 

Among those studies that did not report any 
significant difference, three were conducted 
in university laboratory schools, and another 
three found equivalence in the f^st year of the 
program but started to see favorable changes 
in subsequent years. In the case of 
laboratory schools, control classes were 
similar to experimental ones or they appeared 
to be very high quality classes. Perhaps for 



these reasons, significant differences did not 
appear in those circumstances. 

Across many studies, the duration of the 
program was correlated with higher positive 
differences. Other common characteristics in 
academically successful nongraded plans 
were subjects organized by levels, text 
written in accordance with those levels, and 
regrouping of students in multiage 
environments that allowed teachers to reduce 
the heterogeneity of their instructional 
groups. 

Nongraded Programs Incorporating 
Individualized Instruction 

Many studies of nongraded programs 
included indications that individualized 
instruction was an important part of the 
nongraded plan. These individualized 
approaches included one-to-one tutoring, 
programmed instruction, and learning activity 
packages. Two examples of the types of 
individualization adopted are as follows: 

Most students would be on 
contracts of work ... [which] might 
last from one to five days with the 
student coming to the teachers only 
in particular moments of difficulty. 
(Bowman, 1971, p. 46). 

The Individually Prescribed 
Instruction mathematics program ... 
was used in the model school. This 
individualized system of instruction 
providf;d each student with the 
opportunity to work on 
undeveloped s'lills, to obtain a 
diagnosis of n .w learnings, and to 
receive a prescription for the next 
sequence of material to be mastered. 
Math specialists, instructional aides, 
and volunteer aides were available 
to pupils on a one-to-one basis. 
(Jeffreys, 1970, p. 30) 

All but one of the eleven studies of this type 
were conducted in a brief period, 1969-1973, 
with a median of 1971. This is considerably 
later than the time frame in which the studies 
summarized in Table 1 appeared. The 
median publication date for the Joplin-like 



programs is 1962, and for the comprehensive 
(multiple subjects) programs, it is 1969. 
What this progression suggests is that 
individualized instruction increasingly 
became part of the nongraded elementary 
school in the late 1960's and early 1970's, at 
a time when individualization was gaining 
popularity in North American schools in 
general. 



Table 3 Here 



Table 3 summarizes the characteristics and 
findings of the 11 studies of nongraded 
programs including individualized 
instruction. Only one of these, conducted by 
Higgins (1980), randomly assigned students 
to nongraded or graded classes. Reading 
was the only subject of interest The physical 
arrangements and instructional practices in 
both settings were essentially di^erent. For 
the most part, instruction for children in the 
nongraded classrooms was individualized. 
The graded classes were grouped by ability 
and discussions took place in each group to 
check comprehension. The scores obtained 
by each group of students on the 
Metropolitan Reading Test did not show any 
significant difference (ES = +.02). 

Sie (1969) studied pupils from second, third, 
and fourth grades who were in two schools, 
one with a traditional graded plan and the 
other with a nongraded plan. The students 
were matched according to their SAT scores. 
Both ..chools shared similar group instruction 
in the areas of work study skills, social 
studies, and science. The principal difference 
between them was that the nongraded school 
emphasized individualized instruction in 
reading and arithmetic, while the graded one 
used some form of within-class grouping for 
reading. Out of 24 gain scores computed for 
the SAT subtests, the nongraded students 
scored significantly better in one, arithmetic 
computation, while the graded students 
performed significantly better on paragraph 
meaning and language subtests. The overall 
effect iizc was near zero. 

Jeffreys (1970) evaluated the academic 
achievement of children in a nongraded 



program characterized by an open space 
building design, team teaching, and 
individualized instruction, with that of 
children in a traditional graded plan. In the 
nongraded school, reading ability levels were 
used to group pupils for language arts, and 
skill levels were used to group them 
homogeneously in science. Students were 
grouped heterogeneously for social studies 
and health. In addition to spending more 
time in individualized and small group 
settings (math and spelling instruction 
followed an individualized system), 
nongraded students were found to initiate 
verbal interaction with teachers and to be 
involved in after-school activities a 
significantly greater number of times. 
However, no significant differences were 
found on the Iowa Test of Basic Skills. 

Another evaluation of a nongraded program 
operating in an open area and using team 
teaching was done by Wilt (1970). The 
author administered a teacher questionnaire to 
identify differences between both schools 
studied with respect to their internal structure, 
operating procedures, and teacher and student 
flexibility. Teachers in both schools 
supported the basic concept of individualized 
instruction, and it appears that those in the 
nongraded program used it somewhat more. 
There is no mention of its use in any specific 
subject; apparently it was used whenever the 
need arose. Consequently, criteria for 
grouping in the nongraded plan were more 
diverse (interest, academic achievement, 
student-teacher relationship) than in the 
graded program, where homogeneous 
grouping according to performance level 
prevailed. Despite the differences in venical 
and horizontal organization, students' scores 
on the Iowa Test of Basic Skills did not differ 
significantly. 

Among the matched studies lacking evidence 
of initial equality. Ward (1969) investigated 
the largest sample. He compared the 
academic achievement of children in four 
different schools, two of them implementing 
nongraded programs and two following 
graded plans. Results favored the 
experimental group in each of the 72 
comparisons (although only 16 of these were 
statistically significant differences). The 



author notes that the nongraded schools 
differed mainly in the larger amount of time 
used by their teachers and pupils in reading 
and arithmetic. The nongraded approach 
exhibited a move flexible use of time and 
provided "the kind of 'atmosphere' which is 
conducive to the individualization of 
instruction" (Ward, 1969, p. 168). 

Burchyett (1972) found the largest 
differences in favor of the conventional 
school among the studies reporting use of 
individualized instruction (total ES = -. 08). 
None of these, however, were statistically 
significant for all of the three grades studied. 
He compared children attending a nongraded, 
multiaged, team-taught school with children 
attending self-contained classrooms in a 
graded organization. Unfortunately, the 
author dcs oot specify which areas of 
insmictiOK V approached on an individual 
basis, whicli were characterized by multiage 
grouping, and how organizational patterns 
differed in the schools under study. 

One of the two studies in Table 3 reporting 
significant differences in favor of the 
nongraded program is a study by Bowman 
(1971). He compared pupils from first to 
sixth grade in a conventional graded school 
with students in a nongraded program from 
another school. The latter used 
individualized instruction, team teaching, 
flexible grouping, and learning centers. 
Individual work was emphasized in reading 
and mathematics: contracts for work on 
individualized units were agreed upon by 
teachers and pupils. Grouping across class 
and grade lines was the organizational 
arrangement for all other subjects, although 
curriculum changes were also undertaken for 
social studies, music, and art (science being 
an exception due to the lack of time to plan 
new units adaptable to a multiage situation). 
Strong positive effects on the academic 
achievement of intermediate students were 
found (ES = +.52), but the nongraded plan 
did not have similar results on the academic 
achievement of primary students (ES = 
+.06). 

Killough (1971) reports another study with 
significant positive effects of a nongraded 
plan implemented in an open-space school. 



In this study, children benefited from being 
in a nongraded program from first grade 
through the junior high school years. After 
three years in the program, pupils had 
significantly higher cognitive achievement 
gains than control students. Details of the 
intervention were not described. 

Walker (1973) also studied the long term 
effects of graded and nongraded primary 
programs. After rating the degree of 
nongradedness of each of the six programs 
he studied, his conclusions from comparing 
those that truly followed the nongraded 
principles with all the other plans resembled 
those of Bowman (1971). Scores began to 
differ significantly when children remained in 
a nongraded school after the primary unit. 
Walker found the greatest gap in academic 
achievement favoring these children at the 
fifth grade level. From his descriptions of 
the six programs studied, they differed 
mainly in the graded materials and 
terminology used in four schools and absent 
in two. Individualized instruction and 
grouping across grades was present in most 
schools, although with varying emphasis in 
reading and mathematics. 

A sharper contrast between self-contained 
classes and nongradedness with 
individualized instruction is evident in a study 
by Case (1970). It compared three 
conventional elementary schools to one 
nongraded middle school. The elementary 
schools used ability grouping and 
concentrated primarily on the development of 
reading and mathematics skills (instruction 
was given to smaller homogeneous groups), 
while pupils in the nongraded program were 
encouraged to do independent study in these 
subjects. The study reports no significant 
differences between the gain scores of each 
group of students (total ES - +.09). 

Givens (1972) evaluated fifth graders in two 
different schools. The demonstration school 
featured team-teaching, multiage grouping of 
pupils, an open-space concept, and . 
individualization of instruction. Local 
universities contributed interns and student 
teachers to the schools' staff. No funher 
description of the differences in instructional 
practices was provided. The standardized 
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test scores did not favor any of the schools 
(total ES s .00). An equal effect size was 
obtained in a small study conducted in the 
Snake River School District (1972). 

Summary and discussion: Nongraded 
programs incorporating individualized 
instruction. Considered together, the results 
of research on these nongraded programs are 
remarkably consistent. No significant 
differences appear in most studies. A median 
effect size of essentially zero (ES = +.02) 
was found across nine studies from which 
effect sizes could be computed. These 
fmdings suggest that nongraded programs 
using individualized instruction are equivalent 
to graded plans in terms of academic 
achievement. As the nongraded plans 
became more complicated in their grouping 
arrangements, they apparently lost the 
comparative advantage Joplin-like or 
comprehensive nongraded programs had. 

There is one interesting trend in the data on 
nongraded programs using individualized 
instruction: more positive effects were 
obtained with older than younger children. It 
may be that students need a certain level of 
maturity or self-organizational skills to profit 
frcm a continuous progress program which 
includes a good deal of independent work. 
Another indication of this is the observation 
that the longer the duration of the program, 
the better the results. 

Individually Guided Education 
(IGE) Programs 

Ten studies, most of which were done in the 
1970's, met the inclusion criteria for this 
review. Since individualized instruction is a 
characteristic of IGE schools, comparisons 
between them and non-IGE schools cast light 
upon the effects of programs that stress 
individualized instrucuon. 

Although the degree of implementation of 
IGE processes varied from one research 
setting to another, IGE concepts, 
components, and practices weie clearly 
established by its developers at the University 
of Wisconsin (Klausmeier et al., 1971). As 
an ideal nongraded plan, the IGE program 
takes into account individual differences and 



uses specialized curriculum materials in 
reading, mathematics, and other subjects. 
But the IGE program is far more complicated 
than a usual nongraded program. Most 
reports do not provide any description of the 
type of intervention actually experienced by 
the experimental schools; it is implied that 
their organization follows the structure set in 
the implementation guidelines of 1971. 
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Schneiderhan (1973) did the only randomized 
study of IGE programs. She compared two 
experimental programs, Individually Guided 
Instruction (IGI) and Individually Guided 
Education (IGE), with a traditionally graded 
program. Fourth through sixth grade 
students from one school were randomly 
selected to follow an IGI or a traditional 
program. Their ITBS scores and the scores 
from a third group of students in a second 
school with an IGE program were used in the 
comparison. Both IGE and IGI programs 
were characterized by nongradedness, team 
teaching, multiage grouping, differentiated 
staffmg, and open-space environments. It 
can be interrra from the short descriptions 
provided that the only difference between 
them is the additional components an IGE 
program has: an Instructional Improvement 
Committee and a Systemwide Program 
Committee. 

There were no consistent significant 
differences found between the academic 
achievement of any of the three groups. It is 
possible that only one year in the IGI 
program after several years (at least three) in 
the conventional graded plan could not 
dramatically affect children's performance. 

Price (1977), in the largest matched study 
lacking evidence of initial comparability, 
started by measuring the level of 
implementation of the 3S processes 
emphasized in IGE schools. Then the author 
compared students' academic achievement in 
schools that were high implementers of the 
IGE philosophy with students' achievement 
in schools that were low implementers. 
Schools identified as high implementers, 
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compared to low implementers, were 
associated with either higher or comparable 
pupil outcomes. Significant differences were 
found for reading achievement, but these 
cannot be correlated with specific classroom 
practices and organization since no 
description of them was provided. The 
fmdings of this study are biased by the fact 
tha^'high implementers are probably better 
schools than low implementers, and this is 
likely to be related to student achievement 
differences. 

Five other studies showed no consistent 
diffei-ences among the schools compared. 
Except for Kuhlman's, none of them 
carefully describe the organizational 
arrangements adopted by each program. 
Biemacki (1976) evaluated an IGE multiunit 
program implemented by an inner-city 
elementary school. The academic 
achievement of selected students attending 
grades 3 through 6 from the multiunit school 
and from a self-contained graded school was 
compared, and no signiHcant differences 
were found (total ES = +.17). Klaus (1981) 
tried not only to compare achievement of 
students in IGE and non IGE schools, but to 
determine the effect the number of years 
spent by a student in each program had on his 
or her performance. His sample consisted of 
students who attended the same elementary 
schools and remained in the system through 
eleventh grade. The IGE program ended at 
the sixth grade level. Overall student change 
scores from fourth to sixth grade showed no 
significant effects of the program (total ES = 
■f.05), and no significant differences were 
found in the achievement of eleventh grade 
students. This study reports larger 
differences favoring the scores of fourth 
graders in IGE schools. These differences 
become smaller in subsequent grades. 

In the only study that favors non-IGE 
schools. Flowers (1977) studied students' 
academic achievement in open space IGE 
schools and in traditional schools. He found 
no significant differences between their 
standardized test scores (total ES = -.25). 
Also, Henn's (1974) comparison of IGE and 
non-IGE programs did not yield significant 
differences in the academic achievement of 
students in either program (total ES = +.03). 



Kuhlman (1985) compared four types of 
school organization: conventional graded 
schools, traditional alternative schools (also 
graded), open alternative schools, and IGE 
schools. From the author's descriptions it is 
clear that the last two types of schools 
emphasized individualized instruction and 
used a nongraded approach. In regression 
analyses, the variable for school organization 
did not obtain a significant coefficient. 

Three other studies, besides the one 
conducted by Price, found significant 
differences in favor of IGE schools. 
Bradford (1972) studied an IGE school 
characterized by multilevel programs for 
reading, mathematics, and spelling, by an 
effective use of a multimedia approach and 
community resources for classroom 
enrichment, and by experiences with 
dramatization as a classroom technique. 
Preassessment tools were used to assemble 
students in small groups, independent study, 
or in a one-to-one relationship. Students' 
gains were close to statistical significance in 
reading, and significantly greater in 
mathematics in the IGE school than in self- 
contained classrooms. 

Burtley's (1974) comparison of IGE and 
traditional programs favored the former with 
respect to academic achievement (total ES « + 
.48). The author reports substantial 
differences in teachers' behavioi and in 
instructional programs between the two plans 
compared. Teachers in the IGE school 
assumed more professional responsibilities 
and engaged in team teaching. Children in 
this school were exposed to several 
instructional modes: large or small multiage 
groups, one-to-one tutoring, independent 
study, and pairing. Another important 
difference was that an effort was made to 
avoid teaching subjects in isolation from each 
other; consequently, skills were constantly 
reinforced. 

Finally, a small study done by Soumokil 
(1977) reported the largest total effect size 
(+ .80). As in the Price study, the research 
started by assessing the differences between 
an IGE elementary program in one school 
with a standard program in another. After 
confirming the character of each school's 
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label, the author proceeded to compare 
students' standardized test scores. IGE 
students scored significantly higher. 

Summary and discussion: IGE programs. 
Overall, research findings on IGE schools 
resemble results obtained by other studies on 
nongraded programs incorporating 
individualized instruction (Table 3). The 
median effect size across six studies from 
which effect sizes could be computed was 
near zero (ES = +.11). Nevertheless, four 
studies reported significant differences in 
favor of IGE schools and they all are 
evaluations of schools which clearly differ 
from each other. It seems that schools which 
are closer to a full implementation of IGE 
concepts supply students with a wider range 
of instructional possibilities for their speciHc 
needs: small groups, one-to-one tutoring, or 
independent work. This finding supports the 
argument that selective use of individualized 
instruction can yield positive results for 
children's academic performance. 

Studies Lacking an Explicit 
Description of the Nongraded 
Program 

The last group of studies, summarized in 
Table 5, includes six studies dated in the 
1960's and six in the 1970's, all of which 
lack an explicit description of the type of 
intervention applied to experimental schools 
and the characteristics of control schools. 
Two doubts confront any reader of these 
reports: to what extent is the nongraded label 
a good description of practices in the 
experimental situations, and what 
characteristics do control schools have that 
really make them fit a conventional 
description. The value of these studies is 
perhaps in putting to rest the idea that simply 
giving a school an innovative label, in this 
case "nongraded," will have some effect on 
student learning. These studies are included 
for the sake of completeness, but little can be 
learned from them. 
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In the largest matched-equivalent study, 
Hickey (1962) found that students in 
nongraded primaries in seven Catholic 
schools learned significantly more after three 
years than did students in similar graded 
schools (total ES = + .46). 

Among all other studies, only the smallest 
(Engel and Cooper, 1971) reponed positive 
significant differences (total ES = + 1.10). 
This study also differs from all others in its 
assurance that the schools compared belong 
to extremes in an index of nongradedness. 

Two medium-sized matched studies which 
lacked evidence of initial equality reported 
significant difference in favor of graded 
schools. Vogel and Bowers (1972) 
conducted a one-year study of students in 
kindergarten through sixth grade. Pupils in 
graded schools scored higher on the 
standardized test (SAT), but significantly 
lower on measures of conceptual maturity 
(according to the Harris Draw-A-Person 
Test). 

The second study that found significantly 
higher achievement in graded schools was 
also one of the lowest in methodological 
quality. Carbone (1961) compared the 
achievement of students in traditional graded 
schools to pupils' outcomes in schools 
mentioned by Goodlad and Anderson (1959) 
as nongraded. controlling for IQ scores. The 
students involved were in grades four, five, 
and six, thus they were one, two, or three 
years (respectively) beyond their experience 
in the nongraded primary. Further, there 
were substantial IQ differences between the 
two sets of students, and teacher 
questionnaires indicated very few differences 
between the two sets of teachers in reported 
classroom practices. 

Summary and discussion: Programs not 
adequately described. This is the only group 
of research studies in whic^h a trend is not 
evident. The median effect size is near zero 
(ES = +.02), but two studies found 
significantly positive effects and two 
significantly negative ones. The most serious 
limitation of these studies is the lack of 
descriptions that could help to interpret ;he 
findings. A closer look at the four studies 
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that present significant differences makes the 
argument in favor of the nongraded programs 
more convincing. Both positive studies had 
greater methodological ciuality: both had 
evidence of initial equality (students were 
matched on IQ), and Engel and Cooper even 
tested the validity of schools' labels. In 
contrast, each of the negative studies has 
some serious problems: inconsistency of 
findings or flaws in the experimental setting. 

Interactions With Study Features 

In addition to differentiating results according 
to categories of programs, assessments were 
made of the interaction of nongraded vs. 
graded organization with other features of the 
studies. One finding, which buttresses the 
main conclusions of this review, was that 
program effects for the Joplin-like and 
comprehensive models were particularly 



strong and consistent in the higher-quality 
studies, the randomized and matched- 
equivalent experiments. However, there 
were no consistent patterns with respect to 
effects at different grade levels (1-3 vs. 4-6) 
or subjects (reading, math, or language arts). 
Longer implementations (more than one year) 
were only inconsistently associated with 
larger effects. 

Program effects declined according to year of 
publication of the studies, but this is of 
course confounded with program types; past 
studies of the Joplin-like and comprehensive 
models without individualization were 
reported in the 1960's, while most studies of 
nongraded programs emphasizing individual- 
ization, including studies of IGE, were 
reported in the 1970's. 



Does NongnuUng Accelerate or Decelerate Student Progress? 



One of the principal rationales for nongrading 
is that it allows students to spend more time 
in the grades involved, if necessary, until 
they can reach a high level of performance, or 
to spend less time if they are able to go more 
quickly than other students. Surprisingly, 
only one study ever actually assessed the 
degree to which nongraded students took 
non-normative amounts of time to complete 
the primary or elementary grades. This was a 
study by McLoughlin (1970), which 
compared students in graded and ungraded 
primary programs in eight New York State 
school districts. 

The nongraded programs used flexible cross 
class and cross-age grouping, teaching to 
homogeneous groups and continuous 
progress curricula, and would therefore 
probably fall into the "comprehensive" 
category defined earlier. The comparisons 
were made in 1964 65 and again in 1965-66. 
In 1964-65, an average of 4.4% of students 
took an extra year to complete the primary 
grades in the nongraded schools, 4.6% in the 
graded ones. In 1965-66, 2.9% of 
nongraded students took an extra year, while 



7.3% of graded students had been retained. 
No students were accelerated in either type of 
program in 1964-65, and one tenth of one 
percent were accelerated in the nongraded 
schools in 1965-66. 

Put another way, 95.6% of the nongraded 
students made normative progress through 
the primary grades in 1964-65, and 97.0% in 
1965-66. What this means is that, at least in 
the time and places studied by McLoughlin, 
nongrading was not being used as a means of 
altering the amount of time students spent in 
the pnmary grades. On the contrary, in 
1965-66 more students were "decelerated" by 
retention in the graded schools than the 
number that took the opportunity to spend 
more time in the primaiy grades offered by 
the nongraded structure. 

McLoughlin (1970) also checked to see 
whether schools that had been implementing 
the nongraded plan over a longer period had 
more students who made non-normative 
progress than newer nongraded programs. 
There was a slight (nonsignificant) trend, but 
it was in the opposite direction. First-year 
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nongraded programs had somewhat more 
students making non-normative progress than 
did schools which had implemented 
nongrading for two to seven years. 

If McLoughlin's findings apply to other non- 
graded programs, this has important 
methodological and substantive conse- 
quences. Methodologically, there is a con- 
cern in studies of nongrading that if non- 
graded students take more time to complete 
the primary grades, their test scores will be 
artificially increased. That is, if "third year" 
students in a nongraded school were older on 
average than third graders in a graded school, 
this could explain any test-score advantage of 
the nongraded programs. It would be impor- 
tant to know this is not the case. 

Substantively, McLoughlin's findings may 
be seen as questioning one assumption of 
many advocates of nongrading, who often 
paint a picture of the low achieving child 
proceeding happily and successfully through 
the grades, never particularly aware that he or 
she is taking four years to accomplish what 



his or her classmates are completing in three. 
Yet students (and, more particularly, their 
parents) can count, and they know who their 
classmates were when they entered school. 
The pressures to have students make 
normative progress may be as strong in 
nongraded as in graded programs. 

Yet the main thrust of Goodlad and 
Anderson's rationale is not affected by 
McLoughlin's findings. It is still plausible 
that deceleration in continuous-progress 
curriculum is preferable to retention. 
Further, in a flexible nongraded program it 
may be that students who would otherwise 
fall behind can be identified and given extra 
assistance and then catch up with their peers. 
The nongraded plan might be seen not as a 
way to give low achievers more time, but 
rather as a way to use time and other 
resources more flexibly. A student who is 
not reading at the end of first grade might 
well be reading at the end of second if he or 
she receives extra help (and does not suffer 
the humiliation of repeating first grade). 



Discussion 



As the nongraded elementary plan re-appears 
in schools of the 1990's, it is important to 
learn about the history of this movement of 
thirty years ago. Most importantly, we need 
to understand the achievement effects of 
nongraded organization, and to understand 
the conditions under which achievement was 
or was not enhanced by this innovation. 

A review of research on the nongraded 
elementary school is particularly needed 
today because there was little consensus on 
its effects in its own time. Only two reviews 
examined portions of the literature, and they 
came to opposite conclusions. Pavan (1973, 
1977) concluded that the evidence favored the 
nongraded primary, while McLoughlin 
(19<S7) stated that most research showeid no 
differences between graded and nongraded 
plans. 

Using the review methods they applied, the 
evidence presented here from a much larger 



set of studies could be interpreted as 
confirming both Pavan's and McLoughlin's 
contradictory conclusions. 



Table 6 Here 



Table 6 summarizes the outcomes of the 57 
studies which met the inclusion standards. 
Looking only at the "box score" of significant 
and non-significant positiN ^. and negative 
findings, the results can be read to support 
either McLoughlin's (1967) negative 
conclusion or Pavan's (1977) positive one. 
McLoughlin argued that since non-significant 
findings outnumbered significant positive 
ones, the effects of the nongraded primary 
were equivocal. Twenty-five years later, the 
proportions of significantly positive findings 
are like those he reported; only twenty of the 
57 studies were significantly (positive Pavan 
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(\97X 1977) came to the opposite 
conclusion, noting that significant positive 
findings far outnumbered significant negative 
ones. This is also true in the present review; 
only three studies significantly favored 
graded programs, while twenty favored 
nongraded ones. 

However, the conclusions of the present 
♦ review conform to neither McLoughlin's nor 
Pavan's conclusions. Instead, the evidence 
presented here supports a conclusion that the 
effects of nongraded programs depend on the 
type of program being implemented. Using 
median effect sizes rather than box scores, it 
is clear that the effects of nongraded 
organization are strongest when the program 
focuses on the "vertical organization** of the 
school, when nongrading is used as a 
grouping strategy but not as a framework for 
individualized instruction. 

Four categories of nongraded programs were 
examined, plus one group of studies in which 
the nature of the nongraded program could 
not be determined. Studies in two of these 
categories clearly supponed the nongraded 
plans. These are the Joplin-like programs, in 
which students are grouped across age lines 
in just one subject (usually reading), and the 
comprehensive programs, which involve 
cross age grouping in many subjects but still 
rely on teacher-directed instruction. Not only 
were median effect sizes for studies in these 
categories clearly positive (+.46 for Joplin- 
like programs, +.34 for comprehensive), but 
the best-designed evaluations are the ones 
most likely to show the positive effects. 

In contrast, nongraded programs which 
incorporated a great deal of individualized 
instruction (and conespondingly less teacher- 
directed instruction), including Individually 
Guided Instruction (IGE), were less 
consistently associated with achievement 
gain. This is not to say that these approaches 
reduce student achievement; rather, their 
effects are very inconsistent, generally neither 
helping nor hurting student achievement, 
with more studies finding positive than 
negative effects (especially in the case of 
IGE). Poorly described nongraded programs 
also had median effects very near zero, 
perhaps because experimental and control 
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groups may not have differed in anything 
essential except for label. 

What accounts for the relatively consistent 
positive effects of the Joplin-like and 
comprehensive nongraded plans and the less 
consistent effects of programs incorporating 
individualization? Recent developments in 
educational research suggest some 
possibilities. 

The most obvious reason that incorporating a 
great deal of individualization might have 
reduced the effectiveness of the nongraded 
elementary school is suggested by research 
on individualized instruction itself, which has 
generally failed to support this innovation 
(see, for example, Horak, 1981; Miller, 
1981; Bangen, Kulik, & Kulik, 1983; 
Rothrock, 1982). Correlational evidence 
from process-product studies of more and 
less effective teachers has consistently found 
that student learning is enhanced by direct 
instruction from teachers, as contrasted with 
extensive reliance on individualization, 
seatwork, and written materials to convey 
content to students (see Brophy & Good, 
1986). 

Further, to the degre.* ;hat the nongraded 
elementary school came to resemble the open 
school, tiie research finding few achievement 
benefits of this approach takes on increased 
relevance (e.g., Giaconia & Hedges, 1982). 

In its simplest forms, the nongraded 
elementary school has many likely benefits. 
By grouping students across age lines, it may 
allow teachers to reduce the number of 
within-class reading and math groups they 
teach at any given time, thereby reducing the 
need for independent seatwork and follow- 
up. In fact, in several of the evaluations of 
Joplin-like programs, it was noted tiiat cross- 
age groupings made within-class groupings 
(i.e., reading groups) unnecessary, so 
teachers could spend the class period teaching 
the entire class, with no need for seatwork 
unless they saw a specific need for it. 

Another factor in the success of simple 
nongraded plans is the likelihood that flexible 
cross-age grouping allows teachers to fully 
accommodate instruction to the needs of each 



child in a particular subject while still 
delivering instruction to groups. Goodlad 
and Anderson's (1959, 1963) critique of 
traditional ability grouping is that it does not 
truly reduce heterogeneity in the specific skill 
being taught. Grouping students within 
classes or within grades (in all but the largest 
elementary schools) does not provide enough 
opportunity to have group instruction closely 
tailored to student needs. Flexible cross-age 
grouping does provide such an opportunity, 
so the instructional costs of grouping (in 
terms of disruption, movement, and stigma 
for children in low groups) can perhaps be 
outweighed by the greater opportunity to 
adapt instruction to the precise needs of 
students and to continue to adapt to students' 
needs by examining and changing groupings 
at frequent intervals (see Slavin, 1987). 

If the effectiveness of nongraded organization 
is due to increased direct instruction delivered 
at each student's precise instructional level, 
then it is easy to see how a move to greater 
individualization would undermine these 
effects. Individualized instruction, learning 
stations, learning activity packets, and other 
individualized or small group activities reduce 
direct instruction time with little 
corresponding increase in appropriateness of 
instruction to individual needs (in comparison 
to the simpler cross-age grouping plans). 

It is difficult to assess the impact of one of 
the key rationales given for the nongraded 
plan throughout its history, the opportunity to 
allow at-risk students to take as much time as 
they need to complete the primary or 
elementary grades without the use of 
retention. An early study by McLoughlin 
(1970) found that self-described nongraded 
programs did not generally take advantage of 
the opportunity to let low achievers take more 
time, but we do not know if McLoughlin' s 
findings would apply to most nongraded 
programs implemented now or in the past. 
Clearly, however, the effectiveness of the 
simpler nongraded programs does not depend 
on the opportunity to accelerate or decelerate 
student progress, since most studies found 
positive effects in the first year of 
implementation, before any acceleration or 
deceleration could take place. 



This discussion is completely speculative. 
There is much more we would have liked to 
know about how nongraded programs were 
actually implemented in the '50's, '60's, and 
'70's. The return of the nongraded idea in 
the 1990's may, however, allow us to 
answer many questions. We need not only 
assessments of current forms of nongrading, 
but also component analyses to understand 
which elements of nongrading account for the 
program's effects, and studies combining 
qualitative and quantitative methods to 
understand what really changes in nongraded 
schools and what differences these changes 
make in student achievement. 

Does Earlier Research on the 
Nongraded Elementary School 
Have Relevance Today? 

How much relevance does research on the 
nongraded elementary school have to 
education today? 

Many of the problems that the nongraded 
elementary school was designed to solve are 
still with us today, and the re-emergence of 
nongraded programs appears to be due in 
large part to concern about these problems, 
especially the tension between retention and 
social promotion and rejection of traditional 
forms of ability grouping. 

Yet there are also many differences between 
education today and that of thirty years ago. 
The general perception that both 
individualized instruction (e.g., Bangert et 
al., 1983; Horak, 1981) and the open 
classroom (e.g., Giaconia & Hedges, 1982) 
failed in their attempt to increase student 
achievement means that it is unlikely that the 
nongraded elementary schools of the 1990's 
will, like those of the early 1970's, embrace 
these methods. As a result, it is more likely 
the nongraded programs of the 1990's will 
resemble the simpler forms found in this 
review to be instructicnally effective. 

Other developments will certainly influence 
the forms taken by the nongraded programs, 
their effects on achievement, and their 
ultimate impact on educational practice. The 
movement toward "developmentally 
appropriate" early childhood education 
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and its association with nongrading mean that 
the nongradcd primary school of the 1990's 
will often incorporate four- and five-year- 
olds (earlier forms rarely did so), and that 
instruction in nongraded primary programs 
will probably be more integrated and 
thematic, and less academically structured or 
hierarchical, than in other schools. As in the 
early 1970's the effectiveness of the 
nongraded school organization plan may 
become confounded with innovative 
instructional methods. Whether these 
instructional methods will have positive or 
negative effects on ultimate achievement is 
currently unknown. 

The ultimate impact of the nongraded ideal 
will also have much to do with rapidly 
unfolding changes in assessment and 
accountability. One reason for the increase in 
retention, pre-first, and other extra-year 
programs in the late 1970's and early 1980's 
was greatly increased accountability 
pressures in U.S. schools. Retaining more 
students has a strong (though short-lived) 
positive impact on achievement test scores 
reported by grade (not age), because the 
children taking the tests are older (see 
AlUngton & McGill-Franzen, 1991; Slavin & 
Madden, 1991). 

There is currently widespread concern about 
high retention rates (Shepard & Smith, 
1989), yet returning to social promotion 
would greatly reduce test scores in high- 
retention districts. If the nongraded 
elementary school emerges as a means of 
giving low achievers more time in the 
elementary grades, it may be favored by the 
current policies of reporting test scores by 
grade (for the sai?ic i'easons that they favor 
retention). On the other hand, if high-stakes 
accountability systems begin to report 



achievement by age (as does, for example, he 
National Assessment of Educational 
Progress), this advantage may not become a 
factor. 

Clearly, there is a need for much more 
research on the nongraded elementary school 
as it is being implemented today. Because of 
scientific conventions of the time, most of the 
earlier research reviewed here was strong in 
experimental design (most studies used 
random assignment or careful matching of 
experimental and control groups) but weak in 
description of the independent variable, the 
characteristics of the nongraded and graded 
schools. Research done today must be 
strong on both tlimensions. Goodlad and 
Anderson (1963) emphasize that the 
nongraded plan only addresses vertical 
organization, not instruction. Yet, as we 
have seen in this review, differences in 
instructional methods between nongraded and 
graded schools may account for differences 
(or non-differences) in outcomes. 

Research on the nongraded elementary school 
offers a fascinating glimpse into the history 
and ultimate fate of a compelling innovation. 
The return of this idea after nearly twenty 
years of dormancy is fascinating as well. 
This review concludes that the evidence from 
the first cycle of research on the nongraded 
elementary school supports use of simpler 
forms of the model and certainly supports the 
need and potential fruitfulness of further 
experimentation. Yet there is a cautionary 
note in this tale as well. Good ideas can be 
undermined by complexification over time. 
We need a constant cycle of experimentation, 
research, evaluation, revision, and continued 
experimentation to build compelling ideas 
into comprehensive, effective plans for 
school organization and instruction. 
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Appendix 



Abbreviations used in the tables 

+ Results clearly favor nongraded programs 

(+) Results generally favor nongraded programs 

0 No trend in results 

(-) Results generally favor graded programs 

Results clearly favor graded programs 

AG Ability Grouping 

CAT California Achievement Test 

CRT California Reading Test 

CTBS Canadian Test of Basic Skills 

CLTBS California Test of Basic Skills 

DLRT Durrell Listening-Reading Test 

GPRT Gates Primary Reading Test 

IGE Individually Guided Education 

IGI Individually Guided Instruction 

ITBS Iowa Test of Basic Skills 

G Graded Program 

GE Grade Equivalent 

KCT Kansas Competency Test 

LCRT Lee Clark Reading Test 

MAT Metropolitan Achievement Test 

MRT Metropolitan Reading Test 

NARA Ncalc Analysis for Reading Ability 
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NO Nongraded Program 

OS Open-area School 

OST Ohio Survey Test 

PT Piaget-typc Test 

SAT Stanford Achievement Test 

SRAAS Science Research Associates Achievement Series 

STEP Sequential Test of Educational Programs 



Table 1; Nongraded Programs Involving Onlv Suhftct fJoplln^llkg Proyramat) 



Article 



Grades 



Sample Siic 



Duration of 

Prngram 



Deiiyn 



Test Bffect Sizes Tutai 

bv Achievement by Subject 



Hillson. Jones, 
Moon, and Van 
Deveoder (1967) 



2-3 Sbamokin, S2 l.S yrs. Students and leachers 

Pennsylvania (26 NQ, 26 G) 3 yrs. randomly usigned to 

(1 school) (folio wup) NG/Joplin or 

heterogeneous graded 
classes for reading only. 



Matched Studies wilh Hvidgnr^ nf Initi.l P^ualttv 
HalUwcll (r>b.ti 1-3 NewJersey 



Skapski (I960) 



Hart (1962) 



795 

(146 NG. 149 G) 
(1 school) 



1 yr. Compared NG/Juplin in 
reading and spelling U) 
previous yr. 
heterogeneous grouping 
in the graded program. 
Studenu had comparable 
IQ at the beginning of the 
study. 

Burlington. 110 3 yrs. Studenu matched un IQ. 

Vermont (3 schools) Compared NG/Joplin in 

leading only to 
heterogeneous grouping 
in a graded piugram. 

Ilillsboro. 100 3 yrs Studenu matched on sex. 

Oregon (SO NG. SOG) menul maturity, age. and 

(I school) SES. Compared 

NG/Joplin in arithmetic 
only to heterogeneous 
grouping in a graded 
program. 



SAT. 
LCRT» 



CAT (gr 1), 
MAT (gr. 2 3) 



SAT 



CAI' 



Su. ♦ 91 
lit. t.48 
Av +.52 



Rdg. (1.5 yrs.) t.72 • +.33 
Rdg. (3 yrs.) t.33 



Rdg. t.53 • 



4.53 ♦ 



Rdg. ♦ 57 



Math t.46 ♦ 



Matched Studieai {.Ankinf Evidence of Initial fcqualitv 
Bockrath (1958) 4 



Archdiocese 
of St. Louis 



43 



(1974 N(j. 1622 (j) 3 yrs. 
(SO schools) 



Comparison between 
1956 studenu' reading 
achievement with 1953 
studenu' scores. IQ used 
to adjust score medians. 
Stratified s unpie by sl^e 
and location of schools 



CAI 



111 t 

Iaj. ♦ 



kdg. fM 



♦ 51 



14 



ERIC 



BEST copy 0;LA!:LE 



Table 1 Con'i: 



Article 



Grades 



Mktched Siudies Lacking Evidence of Inilial Euuatiiv 



Jacqu^Ue (19S9) 



Moore (1963) 



1-6 



1-2 



Enevoldsen 
(1961) 



Kierslead (1963) 



13 



3 » 



Grand 

Junction, 

Colorado 



Wayne, 
Michigan 



Lincoln, 
Nebraska 



Orwell, 
Vermont 



Sample Si/c 



3517 
(1SS4 NG, 1963 G) 
(4 schools) 



621 

(292 NG, 329 G) 
(4 schools) 



420 

(210 NG,2I0G) 
(7 schools) 



277 

(111 NO, 166 G) 
(2 schools) 



Duration of 



S yrs. 



Desig n Test 



1 yr. 



CAT, 
GPRT 



MAT 



Schools matched on rdg. 
achievement and IQ. 
Pretest used to compute 
gain scores. 

Schools matched on SES 
and curriculum. Change 
scores used to control 
pretest achievement 
significant differences. 
Compared NG/Joplin in 
reading only to 
conventional graded 
plans. 



2 yrs. (2 sch.) Schools matched on SES. CAT 

3 yrs. (1 sch.) IQ used as acovariate. 

Compared NG/Joplin in 
reading only to graded 
programs. 

1 yr. Students equated and ITBS 
classified by IQ and 
pretest. Pretest used to 
compute gain scores. 
Compared NG/Joplin in 
reading only to ability 
grouping in a graided plan. 



Effect Si/es 

\}y Avhic'vynivni bv Suhicn 



Rdg t.03 



III. ..22 Rdg. .41 

Av. .,4.] 
U). .29 



Ml .01 (;(• 
Av f.OH (iB 
U. .14 GE 



Rdg. 0 



Rdg, 02 (il: 



Tolal 



t.Oi 



.4I** 



02 



Key: 



* A key to the abbreviations used is provided as an appendix ui this pa^xrr. 

♦ p<.05 



p<.01 



4r^ 



1 



ERIC 



Tuble 2: Noneraded Prntframs Involvinti MuIUdIi. «..hf«. cis fr»mpreheiisive ProurumO 



I 



Article 



Grades 



Maiched Studies with Evidence of Iniiial Eauaiiiv 

Brody 1-2 Pennsylvania 

(1970) 



Ouo (1969) 



Pcrnn 
(1969) 



Bufne 
(1962) 



Guahno 
(1982) 



Ramayya 

(1972) 



Muck 
(1966) 



Machiele 
(1965) 



35 



I 3 



2-5 



1-3 



Austin. Texas 



Liiile Kuck, 
Arkansas 



Cedar Falls, 
Iowa 



New Jersey 



Darmouih, Nova 
Scoiia 



Buffalo, New 
York 



Urbana« Illinois 



Sample Si/e 



603 

(362 NG. 241 G) 
(3 schools) 

450 
(2 upper middle* 
class lab 
schools) 

288 

(144 NG, 144 G) 
(13 schools) 



234 
(117 NG 117G) 
(8 schools) 



162 
(81 NG, 8) G) 
(2 schools) 



160 
(80 NG, 80 G J 



148 
(1 lab school) 



lOU 
(50 NG, 50 G 
(1 school) 



Duration of 



2 yrs. 
2 yrs. 



Design 



Siudenls matched on IQ. 



Siudcnis maiched on 
prctesi achievemeni. 



Tes^ Effeci Si/es 

by Achievemenf bv Suhjeci 



luliU 



3 yrs. Schools maiched on 

SHS. Siudenis maiched 
on IQ, age, sex, and 
race. 

3 yrs. Schools maiched on 

SbS, enrollaieni, class 
si^e, and leachers* 
experience. Siudenls 
maiched on sex, age, 
and inielligence. 

5 yrs. Schools maiched on SES 

and eihnic mix. 
Students maiched on 
age, sex, and IQ. 

6 yrs. Siudenls matched on 

sex, IQ, and SES. 



3 yrs. Siudenls maiched on 
menial maiuriiy. 

1 yr. Siudents matched on IQ, 
menial age, and 
chronological age. 
Compared siudenls in ng 
pr()gram lo studenls in 
previous yr. 



SAT Hi. t 
Lo. f 



MAT, 
ITBS 



MAT (I -2), 
ITBS (3) 



ITBS Hi. t )9 

Lu. t.l9 



CAT 



TBS 



MAT. 
ITBS 



CAT 



Kdg. ^.2U 
Madi t.73 



Kdg. 0 
Madi 0 



Kdg. f.08 
Maih t.l4 



Kdg. f.l9 
Malh +.17 
Lang. +.67 



Kdg. fA') 
Mdlh +.19 



Kdg. +.41 ♦ 

Malh +.25 

l^g. +.59 ♦ 

Kdg. +.04 

Mdlh -.36 

Lang. +.15 

Kdg. +.61 

Malh +.38 



t.46 



U 



t.ll 



♦ .34 



k42 



.06 



k49 



■is 



ERIC 



Table 2 Con t: 



Article 



Grades 



Muched ^ludics Lacking Evidence of IniLjal Equality 



Zerby 
(1960) 



Morrisiown, 
Pennsylvania 



Sample Si/tf 



394 

(187 NG.207 G) 
(2 schools) 



Duraliun of 



3 yrs. 



Design 



Schools matched on 
SES. IQ score used to 
compute achievement 
beyond anticipated 
achicvemenl level. 



Ifiil Effect Si/es 

by Achievemeni bv Suhfcct 



CAT 



Rdg. +.10 
MaUi +.57 



Chaslam 
(1961) 



Lawson 
(1973) 

Ross 
(1967) 



Morris 
(1968) 



Gumppcr 
(1971) 



4 6 

1. 3. 5 
13 



13. 5 



1-4 



Kangey, 
Colorado 



Kokomo, 
Indiana 

Bloominglon. 
Indiana 



Montgomery 

County. 

Pennsylvania 



Pennsylvania 



360 
(120 NG. 
120 NG. 120 G) 
(1 school) 

338 
(6 schools) 

314 

(128 NG. 186 G) 
(1 lab school) 



1 17 
(57 NG. 60 Ci) 
(1 school) 



(2 schools) 



1 yr. 



1. 3. A 5 

yrs. 

6 monilis 



3 yrs. 



1 yr- 



Key: 

^ A key to the abbreviations used is provided as m appendix lo this pa|)cr. 
♦ p<,05 
p <,01 



Siudcnis matched on sex MAT 
and IQ. Pretest used as a 
covariate. 



IQ used as a covariate. CAT 



Pretest and IQ used as MAT 
covuriates. Students 
noruandomly assigned 
to NG and G programs in 
the school. 

IQ used as a covariate. ITBS, 
Compared students in SAT 
NG program to students 
in previous year. 
Intervention stopped 
after 3 years. 

Schools matched on ULKT. 

SES. enrollment. SAT 

teachers* characterics, 

and students' previous 

academic achievement. 

Pretest used to compute 

gain scores. 



Kdg. f.O! 
Math *.09 



Kdg t 



Kdg t.06(.E 
Math t.06(;E 



Alter 3 yrs. t ♦ 
Alter 5 yrs. t 



gr. I Kdg. 0 
Math (t) 

gr. 24 Kdg. ( ) 
Math • 
Lang. ♦ 



TMble 3: Nontiraded Programs Intorporutinii individu;ji i/i?d InstruLlion 



Cocks 



Randomized Studies 

Higgins 
(1980) 



3-5 



Daion Rouge, 
Louisiana 



MaichoJ SUklios wuh Evidence of Initial Equality 
Sic(l%*;j 2-4 Ames, lowu 



Jeffreys 
(1970) 



3.5 



MowunI 
County, 
Mary kind 



Will (1970) 



Chicago 
Suburb, 
Illinois 



Matched Studies Lacking Evidence ot Initial HqualiLv 



Ward(I9«i) 



1-2 



Fort Worth, 
Texas 



Sample Si/c 



246 

(75 NG, 171 G) 
(3 schools) 



124 

(67 NG, 67 G) 
(2 schools) 



88 

(44 NG,44(j) 
(2 schuuls) 



84 

(32 NG. 52 G) 
(2 schools) 



797 

(376 NG,42l G) 
(4 schools) 



Duration of 
£[UgCiim 



lyr. 



Design 



lyr. 



lyr. 



4 yrs. 



2 yrs. 



Siudcnis randomly 
assigned lo 

nongraded/combinaiion or 
trudiiional reading classes. 
Pretest used lo compute 
gain scores. 



Schools matched on SES. 
Students matched on SAT 
scores. Pretest used to 
compute gain scores. 

Sch(H)ls matched on SES. 
Students matched on 
pretest achievement 
measure. Pretest scores 
and parent occup. status 
used as covariates. 

Students matched on IQ 
and age. 



Schools matched on SES. 
race, and available 
resources. IQ, age, and 
readiness scores used as 
cuvahaies. 



Test 



Effect Sizes 
bv Achievement by Subject 



MRT 



Ml. (+) 
Lo. (+) 



SAT 



ITUS 



ITUS 



MAI 



Rdg. +.02 



Kdg. +.03 
Math +.14 
Lang, -.11 



Kdg. +.0« 
Math -.13 



Kdg. +.49 C'lli • 
Math +.10 GE 
Lang. -.27 GE 



Kdg. + 
Matli (+) 



Total 



+.02 



+ .02 



-.03 



+ .11 GI-. 



51 

ERIC 



Burchyeue 
(1972) 



3-5 



Grand BLutc, 
Michigan 



535 

(332 NG.203 G) 
(2 schools) 



2 yrs. Schools matched on SES. STEP 
Pretest used as a covariaie. 



Rdg. -.(X) 
Math .10 



O 4. 



m 



Table 3 Coot: 



Sample Siya 



Malrtuvt StiiHW I ^rlciny Fvi.lftnfi. nf Inilial Fiiiialily 



Bowman 
(1971) 



Case (1970) 



1-6 



Burlington, 



Montgomeiy 

County, 

Maiyland 



457 



North Carolina (313 NO, 144 G) 
(2 sctMwis) 



269 

(131 N(i, 138 (j) 
(4 schiuol&) 



Kiilougb 
(1971) 



1-8 



Houston, Texas 



267 

(132 NCi. 135 0) 
(4 schools) 



Givens 
(1972) 



Saint Louis, 
Missouri 



100 

(50 N(j, 50 (i) 
(1 lab, I contri)l 
school) 



Walka 
(1973) 



1 12 



Kentucky 



% 

(32 N(i, 64 (j) 



Duration of JJesiga 'Xitm Hffcct Sizes 'WnA 

Emm hv ALhievi'im.pi bv Hu^ vct 



lyr. 


IQusedasacovariaie. 
Pretest used to compute 
gain scores. 


MAT 




kdg. 

Math 


+.27 ♦ 
+.28 ♦ 


+.28 ♦ 


lyr. 


Schools matched on SES. 
Students matched on age, 
sex, race» and SES (higher 
iQ scores for conu-ol 
group). Pretest used to 
compute gain scores. 


SAT 


III. -t-.18 
Av. +.\4 
U). -.01 


Rdg. 
Math 


+.01 
+.16 


+.09 


3 yrs. 


Schools matched on SES 
and ethnic disuibution. 
iQ used as a covariale. 
Ptetesi used to compute 
gain scores. 


SRAAS 




MuUi 


+ ♦ 
+ ♦ 






Students randomly selecurd 
from two populations of 
students that received 
either indivtdualiml or 
traditional instruction. Ex 
post lacio expenoientai 
design. Pretest used to 
compute gain scores. 


riBS 


Hi (+) 
Av. 0 
1.1). (+) 


Kd8. 
Math 


• U 
+.10 


.00 


12 yrs. 


Schools rated on an eight 
dimension scale, the 
Nongiadedness 
Assessment Scale. 
Longitudinal study to 
determine the long lenn 
effects of NG and Ci 
primary school years ( 1 - 
3). Kate of progress used 
as a covariatc. 






Kdg. 
MaU) 
I^ig. 


+ 24 
+.14 
+.17 


+ 18 



5 » 



Table 3 Cool: 



^idlCjg (jQiJgs l/icalinn Sample Si/e 



Malrh«>H Sliidias I iiHf iny Kvulenre nf Inilial lijiialily 

Snake River 1-3 Blackfoot, 78 

School klabo (39 NG. 39 G) 

District (2 schools) 

(1972) 



Key: 

^ A key to the abbreviations used is provided us an appendix lo this paper. 
• p<.05 
P <.01 



Duration of j^CSiSA liiL l^ffecl Sizes 

^BigSm hv Afhit!vi!mfni bv Subject 



1 yr. Students matched on SES. SRAAS Kdg. .00 .00 

Pretest used to compute MaUi .00 

gain scores. 



5{] 



Table 4: Individually GuUmd Kdur« tlon anv,\ 



4> 



Quisle 



Sdueidertm 
(1973) 



Localion 



Sample Si/e 



4-6 



Roseviile, 
Minoesoia 



484 

(206 IGH, 88 IGI, 190 G) 
(2 schools) 



Maichetl SludiLs 1 aLkiiiy Kvidcnce of If utiai Egualiiv 
Price (1^77) 4,6 Iowa 



Biemacki 
(1976) 



3^ 



kC(aus(1981) 



Biatfonl(1972) 



4-6 



1-3 



Pi? 



ERIC 



lolcdo. Ohio 



Michigan 



1081 
(637 hi, 444 lo) 
(14 schools) 



479 

(174 NCi, 305 (i) 
(2 schools) 



433 

(219 NCj.2I4Cj) 



394 

(2WN(i.93 Ci) 
(2 schools) 



Duration of 



Desig n 



1 yr. Simienis randomly 

selected to individually 
guided iristnicUon or 
traditional programs in 
the same school* IQand 
pretest scores used as 
covariaies. 



3 yrs. ("omparison of high and 
low implementers of 35 
processes employed in 
IGH, Schools matched 
on size, SHS, and 
location. IQ used as a 
covariate, 

6 months Schools matched on 
SliS, race, and similar 
achievement in grade 
equivalents in reading 
and math for students in 
graded. Students 
randomly selected from 
chosen schools. Pretest 
used to compute gain 
scores. 

3 yrs. Pretest used to compute 
gain scores. IQ used as 
a covariaie, 

1 yr Students matched on 
sex, Si:S, and n^iiig 
and math achievement. 
I^etest used to compute 
gain scores. IQ used as 
a covariate. 



Test 



riDS 



Effect Sizes 

bv Achieveineni bv Snhiiici 



(ICilaiKl control) Kdg. 0 
Math 0 
laiig. 0 

(KiH&IGIand Kdg. (-) 
conuol) Math 0 
1-ang. (+) 



rius 



MAT 



nus, 

SRAAS 



MAT 



Rdg. + ♦ 
MaOi (^) 



Rdg. +.13 
Math +.20 



Math +.12 

l^mg. -.05 

KiJg + 

MaUi + ♦ 



58 



Tolal 



0 



+ .17 



+ .().S 



+ ♦ 



Table 4 Con'i: 



Anicle 



Sample Si/c 



Matched Studies Lacking Ev idence of IniUal Euualitv 

Bunlcy(1974) 2-3 Woodberry. 

Illinois 



302 

(167 NG, 135 G) 



Flowers (1977) 



3 Westminster, 221 

Colorado (99NG,122G) 



Kuhlman 
(198S) 



2, 4, 6 Kansas 



2(X) 

(50 OS, 50 IGE, 100 G) 



Soumokil 
(1977) 

Henn(1974) 



3.5 



Columbia, 
Missouri 

Ohio 



102 
(2 scIhmIs) 

(24 schools) 
(enrollment; 7 072 NG, 
6958 G) 



Key: 

* A key to the abbreviations used is provided as an appendix to this paper. 

• |X.05 
p<.Ol 



Duration or 



D<?si gn 



Test Effect Si/cs 

by Achievement by Subjccl 



2yrs. Schools matched on MAT Kdg. +.40 

SES, ethnicity, si/e, and Math +.53 ' 

enrollment. Pretest used 
to compute gain scores. 

3 yrs. Best school matches SAT Kdg. -.25 

available among die Lang. -.25 

remaining schools 
within the district based 
on SES. Students 
classified by SES. 

2 yrs, Students randomly KCI" Kdg. 0 

selected from chosen Math 0 

schools. SES and 
number-of-parents used 
as covahalcs. 

2yrs. (gr3) Pretest and IQ used as ITBS K lg. +.7y ♦ 

3 yrs. (gr 5) covariatcs. Math +.K() • 

2 yrs. Schools matched on MAT, Kdg. +.05 

SES, available OST Math +.01 

resources, and teachers' 
qualifications. 



Gu 



Table 5: 



er|c 



Article 



Localioq 



Mai£hcd Sliidies with Evi dence of Iniiial ggtirility 



Hickey(1962) 



Diocese of 
Piiisburgh 



Lair (1975) 



Richardson, 
Texas 



4S 



Aigner(l%l) 



Mycock(1966) 



Bcllcvue, 
Washingion 



Munchcsier, 
England 



Reid(1973) 



Williams 
(1966) 



4 

3 



Alabanta 



Hammond* 
Indiana 



Engeiand 
Cooper (1971) 



Darmouih» 
Nova Scoua 



til 



Sample Size 


Duration of 


DesinD 


Test 


Effeci Sizes 
l2y_Adiky(;mt;ni bv Stuck- m 


ToLiI 


1348 
(745 NG, 603 G) 
(14 schools) 


3 yrs. 


Schools matched on 
SES. Students matched 
onlQ. 


MAT 


Hi. +.31 
Av. +.18 
U. -.01 


• Rdi! + 24 ♦ 
Math +.68 


T.*#0 


463 

(183 NG,280G) 
(12 schools) 


3 yrs. 


Random selection or6 G 
and 2 NG schools. 
Students matched on 
readiness for learning 
scores. 


CLTBS 


Su. -.12 
Hi. -.21 
Av. -.36 


Lang. -.09 


(Ml 


428 

(2I4NG.2I4G) 


3 yrs. 


Groups equated with the 
School and College 
Abilities Total test. 


STEP 


Hi. 0 
Av. 0 
Lo. 0 


Rdg. (-) 
Muth (-) 


(-) 


108 
(4 schools) 


lyr 


Schools matched on 
si/e, resources* and suUT 
ratio and quality. 
Students matched on 
age, sex, and 
inielligence. 


NARA. 
PT 




Kdu 0 
Mulli 0 


0 


100 

(50 NG.50 G) 


3 yrs. 


Students matched on 
age, sex, and mental 
abiliiy. 


SAT 


Hi. -.11 
Av. -.01 
U. -.05 


Rdg. +.01 
Math -.05 
Lang. +.01 


-.01 


76 

(38 NG. 38 G) 


3 yrs. 


Siudenls matched on 
age, sex, and IQ. 


SAT 


Hi. +1.29 
Lo. -1.30' 


• Rdg. -.46 • 
Math -.23 


-.34 


40 

(20 NG,20 G) 
(2 schools) 


6 yrs. 


.Schools selected 
according to an index for 
nongradedness. Students 
matched on IQ. 


CAT 




Rdg. +1.20 
Lang. +1.02 


+ 1.10 



K3 



TableSCoA*i: 

^diclg Qtmiei Lficauoa sample si/e 



Matched Studies; Lacking Evidence of Initiai Equality 

6 



Herringion 
(1973) 



Dade Cuuniy, 
Flordia 



951 
(16 Schools) 



Voget and 
Bowers (1972) 



K-6 



Evansion. 
Illinois 



473 

(224 NG, 249 G) 



Hopkins, 
Oldndgc,& 
Williamson 
(196S) 

Cart)one(l%l) 



3-4 



4*6 



Los Angeles 
County » 
California 



330 

(139 NG. 191 G) 
(4 schools) 



244 

(122 NG, 122 G) 
(6 schools) 



Ren)acle(1970) 



5-6 



Brookings, 
South Dakota 



128 

(64 NG,64G) 
(1 school) 



Key: 

^ A key lo Ihe abbreviations used is provided as an appendix to this pa)x;r. 
• p <.05 
p<.Ol 



Duration of 



Design 



IsiSl HllcctSi/cs luliil 

by Achievement by Student 



yr. Schools randomly 
selected from SES 
ranked lists. Classes 
randomly sekxtcd. 
Pretest used as covariate. 



SAT 



Rilg. (^) M) 
Math -i 



I yr. Teachers malchcd on 

sex, training, experience, 
and age level taught. 
Pretest used as covariate. 



SAT 



cotnposite 



3 yrs. IQ used as covariate. 



CRT 



Rdg. ^.02 



^.02 



At least 3 Schools matched on H US Rdg. - 

yrs. SES. Classes randomly Math - 



mm 



selected. Students Lang, 
matched on sex and age. 
IQ used as covariate 

2 yrs Random selection of ITBS Rdg. ^.24 (ili 31 Gh 

(gr. 5) students in control Math ^.VlCAi 

1 yr. group. IQusedas Lang. ^J.KjL 

(gr. 6) covariate. 



• 



64 



Table 6 

Summary of Effects by Type of Nongraded Plan 



TypeofProsram 


I \JuU 


^ipniftrsint 

Posiive 


Posilivtf 


No 




Neyalive 


IVICUlaJI 


Joplin-Like 


9 


4 


2 


1 


1 


1 


+.46 (7) 


Comprehensive 


14 


8 


2 


1 


3 


0 


+.34 (9) 


InUivii'uuli/ed 


12 


2 


6 


2 


2 


0 


+.02 (9) 


IGE 


10 


4 


3 


2 


1 


0 


+.11 (6) 


Unspecified 


12 


2 


3 


1 


4 


2 


+.01 (6) 



4N 

^ Number of studies in which an effect size could he computed is presented in parenthesis. 
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